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November 13, 2003, with a size of 78,451 kb; Table_14.txt, date of creation 

20 November 13, 2003, with a size of 454 kb; and Table_15.txt date of creation 
November 13, 2003, with a size of 1,690 kb. 

Field of Invention 

The present invention relates to RNA interference ("Ri^Ai"). . 

25 

Background of the Invention 

Relatively recently, researchers observed that double stranded RNA 
("dsRNA") could be used to inhibit protein expression. This ability to silence a gene 
has broad potential for treating human diseases, and many researchers and 
30 commercial entities are currently investing considerable resources in developing 
therapies based on this technology. 

Double stranded RNA induced gene silencing can occur on at least three 
different levels: (i) transcription inactivation, which refers to RNA guided DNA or 



histone methylation; (ii) siRNA induced mRNA degradation; and (iii) mRNA induced 
transcriptional attenuation. 

It is generally considered that the major mechanism of RNA induced silencing 
5 (RNA interference, or RNAi) in mammalian cells is mRNA degradation. Initial 
attempts to use RNAi in mammalian cells focused on the use of long strands of 
dsRNA. However, these attempts to induce RNAi met with limited success, due in 
part to the induction of the interferon response, which results in a general, as opposed 
to a target-specific, inhibition of protein synthesis. Thus, long dsRNA is not a viable 
10 option for RNAi in mammalian systems. 

More recently it has been shown that when short (18-30 bp) RNA" duplexes are 
introduced into mammalian cells in culture, sequence-specific inhibition of target 
mRNA can be realized without inducing an interferon response. Certain of these 

15 short dsRNAs, referred to as small inhibitory RNAs ("siRNAs"), can act catalytically 
at sub-molar concentrations to cleave greater than 95% of the target mRNA in the 
cell. A description of the mechanisms for siRNA activity, as well as some of its 
applications are described in Provost et aL, Ribonuclease Activity and RNA Binding of 
Recombinant Human Dicer, E.M.B.O. J., 2002 Nov. 1; 21(21): 5864-5874; Tabara et 

20 a/.. The dsRNA Binding Protein RDE-4 Interacts with RDE-L DCR-1 and a DexH- 

boxHelicase to Direct RNAi in C. elegans. Cell 2002, June 28;109(7):861-71; Ketting 
et al. Dicer Functions in RNA Interference and in Synthesis of Small RNA Involved in 
Developmental Timing in C. elegans; Martinez et aL, Single-Stranded Antisense 
siRNAs Guide Target RNA Cleavage in RNAi, Cell 2002, Sept. 6; 110(5):563; 

25 Hutvagner & Zamore, A microRNA in a multiple-turnover RNAi enzyme complex. 
Science 2002, 297:2056. 

From a mechanistic perspective, introduction of long double stranded RNA 
into plants and invertebrate cells is broken down into siRNA by a Type III 
30 endonuclease known as Dicer. Sharp, RNA interference — 2001, Genes Dev. 2001, 
15:485. Dicer, a ribonuclease-III-like enzyme, processes the dsRNA into 19-23 base 
pair short interfering RNAs with characteristic two base 3' overhangs. Bemstein, 
Caudy, Hammond, & Hannon, Role for a bidentate ribonuclease in the initiation step 
of RNA interference. Nature 2001, 409:363. The siRNAs are then incorporated into 

2 
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an RNA-induced silencing complex (RISC) where one or more helicases unwind the 
siRNA duplex, enabling the complementary antisense strand to guide target 
recognition. Nykanen, Haley, & Zamore, ATP requirements and small interfering 
UNA structure in the UNA interference pathway. Cell 2001, 107:309. Upon binding to 
5 the appropriate target mRNA, one or more endonucleases within the RISC cleaves the 
target to induce silencing. Elbashir, Lendeckel, & Tuschl, RNA interference is 
mediated by 21- and 22'nucleotide RNAs, Genes Dev 2001, 15:188, Figure 1. 

The interference effect can be long lasting and may be detectable after many 
10 cell divisions. Moreover, RNAi exhibits sequence specificity. Kisielpw, M. et aL 
(2002) Isoform-specific knockdown and expression of adaptor protein ShcA using 
small interfering RNA, J. of Biochemistry 363: 1-5. Thus, the RNAi machinery can 
specifically knock down one type of transcript, while not affecting closely related 
mRNA. These properties make siRNA a potentially valuable tool for inhibiting gene 
1 5 expression and studying gene fiinction and drug target validation. Moreover, siRNAs 
are potentially usefiil as therapeutic agents against: (1) diseases that are caused by 
over-expression or misexpression of genes; and (2) diseases brought about by 
expression of genes that contain mutations. 

20 Successfiil siRNA-dependent gene silencing depends on a number of factors. 

One of the most contentious issues in RNAi is the question of the necessity of siRNA 
design, i.e., considering the sequence of the siRNA used. Early work in C. elegans 
and plants circumvented the issue of design by introducing long dsRNA (see, for - 
instance. Fire, A. et aL (1998) Nature 391 : 806-81 1). In this primitive organism, long 

25 dsRNA molecules are cleaved into siRNA by Dicer, thus generating a diverse 

population of duplexes that can potentially cover the entire transcript. While some 
fi*action of these molecules are non-fiinctional {i.e, induce little or no silencing) one or 
more have the potential to be highly functional, thereby silencing the gene of interest 
and alleviating the need for siRNA design. Unfortunately, due to the interferon 

30 response, this same approach is vmavailable for mammalian systems. While this effect 
can be circumvented by bypassing the Dicer cleavage step and directly introducing 
siRNA, this tactic carries with it the risk that the chosen siRNA sequence may be non- 
fimctional or semi-fimctional. 
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A number of researches have expressed the view that siRNA design is not a 
crucial element of RNAi. On the other hand, others in the field have begun to explore 
the possibility that RNAi can be made more efficient by paying attention to the design 
of the siRNA. Unfortunately, none of the reported methods have provided a 
5 satisfactory scheme for reliably selecting siRNA with acceptable levels of 

fiinctionality. Accordingly, there is a need to develop rational criteria by which to 
select siRNA with an acceptable level of functionality, and to identify siRNA that 
have this improved level of fimctionality, as well as to identify siRNAs that are 
hyperfunctional. 

10 

Summary of the Invention 

The present invention is directed to increasing the efficiency of RNAi, ' 
particularly in mammalian systems. Accordingly, the present invention provides kits, 
siRNAs and methods for increasing siRNA efficacy. 

15 

According to one embodiment, the present invention provides a kit for gene 
silencing, wherein said kit is comprised of a pool of at least two siRNA duplexes, 
each of which is comprised of a sequence that is complementary to a portion of the 
sequence of one or more target messenger RNA. 

20 

According to a second embodiment, the present invention provides a method 
for optimizing RNA interference by using one or more siRNAs that are optimized 
according to a formula (or algorithm) selected from: 
Formula I 

25 Relafive fiinctionality of siRNA= -(GC/3) +(AUi5.i9) -(Tm2o-c)*3 -(Gi3)*3 -(C19) 
+(A,9)*2 +(A3) +(Uio)+(Ai4) -(Us) ^A, 1) 

Formula II 

Relative functionality of siRNA= -(GC/3) -(AUi5-i9)*3 -(Gi3)*3 -(C19) +(Ai9)*2 
30 +(A3) 

Formula III 

Relative functionality of siRNA= -(GC/3) +(AUi5-i9) ^Tm2ooc)*3 



Formula IV 

Relative functionality of siRNA= 

.GC/2+( AUi5-i9)/2-(Tm2ooc)*2 -(Gi3)*3 -(0,9) +(A,9)*2 +(A3) +(U,o)+(A,4) -(U5) - 
(An) 

5 

Formula V 

Relative functionality of siRNA=-(Gi3)*3 -(C19) +(Ai9)*2 +(A3) + (U|o)+(Ai4) -(U5) 

10 Formula VI 

Relative functionality of siRNA=-(Gi3)*3 -(C19) +(Ai9)*2 +(A3) 

Formula VII 

Relative functionality of siRNA--(GC/2) +(AUi5.i9)/2 -( Tm2o°c)*l -(Gi3)*3 -(C19) 
1 5 +(A,9)*3 +(A3)*3 +(Uio)/2+(Ai4)/2 ^U5)/2 -(Ai i)/2 



wherein in Formulas I - VII: 

Tm 2000=1 if the Tm is greater than 20°C; 
20 Ai9 = 1 if A is the base at position 19 on the sense strand, otherwise its value 



is 0; 



0; 
0; 



0; 
0; 



AU15.19 = 0-5 depending on the number of A or U bases on the sense strand 



at -L^-*.- 

positions 15-19; 

25 Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 



Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 



GC= the number of G and C bases in the entire sense strand; 
30 A3= 1 if A is the base at position 3 on the sense strand, otherwise its value is 0; 

Aii= 1 if A is the base at position 1 1 on the sense strand, otherwise its value is 



A 14= 1 if A is the base at position 14 on the sense strand, otherwise its value is 



iiS. il.- '2. . 
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Uio= 1 if U is the base at position 10 on the sense strand, otherwise its value is 

0; 

U5= 1 if U is the base at position 5 on the sense strand, otherwise its value is 0; 

or 

5 

Formula VIII Relative functionality of siRNA = 

(-14)*Gi3-13*Arl2*U7-ll*U2-10*Aii-10*U4-10*C3-10*C5-10* 
9*A,o-9*U9-9*C,8-8*Gio-7*Ui-7*U,6-7*Ci7-7*Ci9 
+7*Ui7+8*A2+8*A4 +8*A5+8*C4 +9*G8 +10*A7+10*Ui8+l 1 *A,9+ 
10 ll*C9+15*Gi+ 18*A3+19*Uio-Tm-3* (GQotai) - 6*(GCl5-l9)- 

30*X; and 

Formula IX Relative functionality of siRNA = 

(14.1)*A3+(14.9)*A6-H(17.6)*A,3+(24.7)*A,9+(14.2)*Uio+(10,5)* 
15 C9+(23.9)*Gi+(16.3)*G2+(-12,3)*A,,+(-19.3)*Ui4-(-12,l)*U2+ 
(-ll)*U3+(-15.2)*U,5+(-11.3)*Ui6+(-11.8)*C3+(-17.4)*C6+(- 
10.5)*C7+ (-13.7)*G,3+(-25.9)*G,9-Tm-3* (GQotai) - 6*(GC,5-i9)- 
30*X 

wherein 

20 Ai = 1 if A is the base at position 1 of the sense strand,^therwise its value is 0; 

A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 

A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 

A4 = 1 if A.is the base at position 4 of the sense strand, otherwise its value is 0; - . i 

A5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
25 Ae = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 

A7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 

Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 

All = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 

A 13 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
30 Ai9 = 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 
C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 
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Cs = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
C6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 
C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 
C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 
. 5 Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 
C18 = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 
C 19 = 1 if C is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

10 Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 

G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 

Gg = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 

Gio= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; 

Gi3 = 1 if G is the base at position 13 on the sense stremd, otherwise its value is 0; 
15 Gi9=lifGis the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 18 base pairs in length, its value is 0; 

Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 

U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
20 U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 

U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 

U7 = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 

U9 =-l'if Uiisrthe base at position 9 on the sense strand, otherwise its value is 0; 

Uio = 1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
25 Ui5 = 1 if U is the base at position 15 on the sense strand, otherwise its value is 0; 

U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 

Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 

Ui8 = 1 if U is the base at position 18 on the sense strand, otherwise its value is 0; 

30 GC15-19 = the number of G and C bases within positions 15 - 19 of the sense strand 
or within positions 1 5 -18 if the sense strand is only 1 8 base pairs in length; 
GCtota! = the number of G and C beises in the sense strand; 

Tm = 100 if the targeting site contains an inverted repeat longer than 4 base pairs, 
otherwise its value is 0; and 
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X = the number of times that the same nucleotide repeats four or more times in a row. 
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15 



According to a third embodiment, the present invention is directed to a kit 
comprised of at least one siRNA that contains a sequence that is optimized according 
to one of the formulas above. Preferably the kit contains at least two optimized 
siRNA, each of which comprises a duplex, wherein one strand of each duplex 
comprises at least eighteen contiguous bases that are complementary to a region of a 
target messenger RNA. For mammalian systems, the siRNA preferably comprises 
between 18 and 30 nucleotide base pairs. 

The ability to use the above algorithms, which are not sequence or species 
specific, allows for the cost-effective selection of optimized siRNAs for specific 
target sequences. Accordingly, there will be both greater efficiency and reliability in 
the use of siRNA technologies. 



According to a fovirth embodiment, the present invention provides a method 
for developing an siRNA algorithm for selecting fiinctional and hyperfiuictional 
siRNAs for a given sequence. The method comprises: 
(a) selecting a set of siRNAs; 
20 (b) measuring the gene silencing ability of each siRNA^om said set; 

(c) determining the relative fimctionality of each siRNA; 

(d) determining the amount of improved functionality by the presence or 
c absence of at least one variable selected fi-om the group consisting of 

the total GC content, melting temperature of the siRNA, GC content at 
25 positions 15-19, the presence or absence of a particular nucleotide at a 

particular position and the number of times that the same nucleotide 
repeats within a given sequence; and 

(e) developing an algorithm using the information of step (d). 

30 - According to this embodiment, preferably the set of siRNAs comprises at least 

90 siRNAs fi-om at least one gene, more preferably at least 180 siRNAs firom at least 
two different genes, and most preferably at least 270 and 360 siRNAs fi-om at least 
three and four different genes, respectively. Additionally, in step (d) the 
determination is made with preferably at least two, more preferably at least three. 
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even more preferably at least four, and most preferably all of the variables. The 
resulting algorithm is not target sequence specific. 



In a fifth embodiment, the present invention provides rationally designed 
5 siRNAs identified using the formulas above. 

In a sixth embodiment, the present invention is directed to hyperfiinctional 

siRNA. 

10 For a better understanding of the present invention together with other and 

further advantages and embodiments, reference is made to the following description 
taken in conjunction with the examples, the scope of which is set forth in the 
appended claims. 

15 Brief Description of the Figures 

Figure 1 shows a model for siRNA-RISC interactions. RISC has the ability to interact 
with either end of the siRNA or miRNA molecule. Following binding, the duplex is 
unwound, and the relevant target is identified, cleaved, and released. 

20 Figure 2 is a representation of the fiinctionality of two hundred and seventy siRNA 
duplexes that were generated to target hxmian cyclophilin, human diazepam-binding 
inhibitor (DB), and firefly luciferase. 

Figure 3a is a representation of the silencing effect of 30 siRNAs in three different 
25 cells lines, HEK293, DU145, and Hela. Figure 3b shows the fi-equency of different 
fiinctional groups (>95% silencing (black), >80% silencing (gray), >50% silencing 
(dark gray), and <50% silencing (white)) based on GC content. In cases where a given 
bar is absent fi-om a particular GC percentage, no siRNA were identified for that 
particular group. Figure 3c shows the firequency of different fiinctional groups based 
30 on melting temperature (Tm). Again, each group has four different divisions: >95% 
(black), >80% (gray), >50% (dark gray), and <50% (white) silencing. 

Figure 4 is a representation of a statisticed analysis that revealed correlations between 
silencing and five sequence-related properties of siRNA: (A) an A at position 19 of 
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the sense strand, (B) an A at position 3 of the sense strand, (C) a U at position 10 of 
the sense strand, (D) a base other than G at position 13 of the sense strand, and (E) a 
base other than C at position 19 of the sense strand. All variables were correlated with 
siRNA silencing of firefly luciferase and human cyclophilin. SiRNAs satisfying the 
5 criterion are grouped on the left (Selected) while those that do not, are grouped on the 
right (Eliminated). Y-axis is "% Silencing of Control." Each position on the X-axis 
represents a unique siRNA. 

Figures 5 A and 5 B are representations of firefly luciferase and cyclophilin siRNA 
10 panels sorted according to functionality and predicted values using Formula VIII. The 
siRNA found within the circle represent those that have Formula VIII values 
(SMARTscores'^^) above zero. SiRNA outside the indicated area have calculated 
Formula VIII values that are below zero. Y-eixis is "Expression (% Control)." Each 
position on the X-axis represents a unique siRNA. 

15 

Figure 6A is a representation of the average internal stability profile (AISP) derived 
firom 270 siRNAs taken fi-om three separate genes (cyclophilin B, DBI and firefly 
luciferase). Graphs represent AISP values of highly fiinctional, functional, and non- 
fiinctional siRNA. Figure 6B is a comparison between the AISP of naturally derived 
20 GFP siRNA (filled squares) and the AISP of siRNA firom cyclophilin B,«DBI, and 

luciferase having >90% silencing properties (no fill) for the antisense strand. "DO" is 
the symbol for AG, fi-ee energy. 

Figure 7 is a histogram showing the differences in duplex functionality upon 
25 introduction of basepair mismatches. The X-axis shows the mismatch introduced inot 
the siRNA and the position it is introduced (e,g,, 8C->A reveals that position 8 (which 
normally has a C) has been changed to an A). The Y-axis is "% Silencing 
(Normalized to Control)." 

30 Figure 8a is histogram that shows the effects of 5 'sense and antisense strand 

modification with 2'-0-methylation on functionality. Figure 8b is an expression 
profile showing a comparison of sense strand off-target effects for IGFlR-3 and 2'-0- 



10 



methyl IGFlR-3. Sense strand off-targets (lower white box) are not induced when the 
5' end of the sense strand is modified with 2'-0-methyl groups (top white box). 



Figure 9 shows a graph of SMARTscores'^^ versus RNAi silencing values for more 
5 than 360 siRNA directed against 30 different genes. SiRNA to the right of the vertical 
bar represent those siRNA that have desirable SMARTscores™. 

Figures lOA - E compare the RNAi of five different genes (SEAP, DBI, PLK, 
Firefly Luciferase, and Renila Luciferase) by varying numbers of randomly selected 

10 siRNA and four rationally designed (SMART-selected) siRNA chosen using the 

algorithm described in Formula VIII. In addition, RNAi induced by a pool of the four 
SMART-selected siRNA is reported at two different concentrations (ICQ and 400nM). 
lOF is a comparison between a pool of randomly selected EGFR siRNA (Pool 1) and 
a pool of SMART selected EGFR siRNA (Pool 2). Pool 1, SI — S4 and Pool 2 SI — 

15 S4 represent the individual members that made up each respective pool. Note that 

numbers for random siRNAs represent the position of the 5' end of the sense strand of 
the duplex. The Y-axis represents the % expression of the control(s). The X-axis is 
the percent expression of the control. 

20 Figure 11 shows the Westem blot results firom cells treated with siRNA directed 
against twelve different genes involved in the clathrin-dependent endocytosis 
pathway (CHC, Dynll, CALM, CLCa, CLCb, Epsl5, EpslSR, RabSa, RabSb, RabSc, 
p2 subunit of AP-2 and EEA-l^. SiRNA were selected using Formula VIII. "Pool" 
represents a mixture of duplexes 1-4. Total concentration of each siRNA in the pool 

25 is 25 nM, Total concentration = 4 x 25 = 100 nM. 

Figure 12 is a representation of the gene silencing capabilities of rationally-selected 
siRNA directed against ten different genes (human and mouse cyclophilin, C-myc, 
human lamin A/C, QB (ubiquinol-cytochrome c reductase core protein I), MEKl and 
30 MEK2, ATEl (arginyl-tRNA protein transferase), GAPDH, and Eg5). The Y-axis is 
the percent expression of the control. Numbers 1, 2, 3 and 4 represent individual 
rationally selected siRNA. "Pool" represents a mixture of the four individual siRNA. 
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Figure 13 is the sequence of the top ten Bcl2 siRNAs as determined by Formula VIII. 
Sequences are hsted 5' to 3'. 



Figure 14 is the knockdown by the top ten Bcl2 siRNAs at lOOnM concentrations. 
5 The Y-axis represents the amount of expression relative to the non-specific (ns) and 
transfection mixture control. 

Figure 15 represents a functional walk where siRNA beginning on every other base 
pair of a region of the luciferase gene are tested for the ability to silence the luciferase 
10 gene. The Y-axis represents the percent expression relative to a control. The X-axis 
represents the position of each individual siRNA. 

Figure 16 is a histogram demonstrating the inhibition of target gene expression by 
pools of 2 and 3 siRNAs duplexes taken firom the walk described in Figure 1 5. The 
15 Y-axis represents the percent expression relative to control. The X-axis represents the 
position of the first siRNA in paired pools, or trios of siRNA. For instance, the first 
paired pool contains siRNA 1 and 3. The second paired pool contains siRNA 3 and 5. 
Pool 3 (of paired pools) contains siRNA 5 and 7, and so on. 

20 Figure 17 is a histogram demonstrating the inhibition of target gene expression by 
pools of 4 and 5 siRNA duplexes. The Y-axis represents the percent expression 
relative to a control. The X-axis represents the positoin of the first siRNA in each 
pool . >, « , 

25 Figure 18 is a histogram demonstrating the inhibition of target gene expression by 
siRNAs that are ten and twenty basepairs apart. The Y-axis represents the percent 
expression relative to a control. The X-axis represents the position of the first siRNA 
in each pool. 

30 Figure 19 shows thatpools of siRNAs (dark gray bar) work as well (or better) than 
the best siRNA in the pool (light gray bar). The Y-axis represents the percent 
expression relative to a control. The-X axis represents the position of the first siRNA 
in each pool. 
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Figure 20 shows that the combination of several semifunctional siRNAs (dark gray) 
result in a significant improvement of gene expression inhibition over individual 
(semi-functional; light gray) siRNA. The Y-axis represents the percent expression 
relative to a control. 

5 

Figure 21 shows both pools (Library, Lib) and individual siRNAs in inhibition of 
gene expression of Beta-Galactosidase, Renilla Luciferase and SEAP (alkaline 
phosphatase). Numbers on the X-axis indicate the position of the 5' -most nucleotide 
of the sense strand of the duplex. The Y-axis represents the percent expression of 

10 each gene relative to a control. Libraries contain siRNAs that begin at the following 
nucleotides: Seap: Lib 1: 206, 766, 812,923, Lib 2: 1117, 1280, 1300, 1487, Lib 3: 
206,766,812,923, 1117, 1280, 1300,1487, Lib 4: 206,812, 1117, 1300, Lib 5: 766, 
923, 1280, 1487, Lib 6: 206, 1487; Bgal: Lib 1: 979, 1339, 2029, 2590, Lib 2: 
1087,1783,2399,3257, Lib 3: 979, 1783, 2590, 3257, Lib 4: 979, 1087, 1339, 1783, 

15 2029, 2399,2590,3257, Lib 5: 979, 1087, 1339, 1783, Lib 6: 2029,2399,2590,3257; 
Renilla: Lib 1: 174,300,432,568, Lib 2: 592, 633, 729,867, Lib 3: 174,300,432, 
568, 592, 633,729,867, Lib 4: 174,432,592,729, Lib 5: 300,568,633,867, Lib 6: 
592,568. 

Figure 22 showS the results of an EGFR and TfhR internalization assay when single 
gene knockdowns are performed. The Y-axis represents percent intemalization 
relative to control. - - ~rz^ 

25 Figure 23 shows the results of an EGFR and TftiR intemalization assay when 
multiple genes are knocked down (e.g. Rab5a, b, c). The Y-axis represents the 
percent intemalization relative to control. 

Figure 24 shows the simultaneous knockdown of four different genes. SiRNAs 
30 directed against G6PD, GAPDH, PLK, and UBQ were simultaneously introduced into 
cells. Twenty- four hours later, cultures were harvested and assayed for mRNA target 
levels for each of the four genes. A comparison is made between cells transfected 
with individual siRNAs vs. a pool of siRNAs directed against all four genes. 
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Figure 25 shows the functionaUty of ten siRNAs at 0.3nM concentrations. 

Detailed Description 
Definitions 

5 Unless stated otherwise, the following terms and phrases have the meanings 

provided below: 

siRNA 

The term "siRNA" refers to small inhibitory RNA duplexes that induce the 
10 RNA interference (RNAi) pathway. These molecules can vary in length (generally 
between 18-30 basepairS) and contain varying degrees of complementarity to their 
target mRNA in the antisense strand. Some, but not all, siRNA have unpaired 
overhanging bases on the 5' or 3' end of the sense strand and/or the antisense strand. 
The term "siRNA" includes duplexes of two separate strands, as well as single strands 
1 5 that can form hairpin structures comprising a duplex region, 

SiRNA may be divided into five (5) groups (non- functional, semi-functional, 
functional, highly functional, and hyper-functional) based on the level or degree of 
silencing that they induce in cultured cell lines. As used herein, these definitions are 

20:^ based on a set of conditions where the siRNA is transfected into said cell line at a 
concentration of lOOnM and the level of silencing is tested at a time of roughly 24 
hours after transfection, and not exceeding 72 hours after transfection. In this context, 
"non-functional siRNA" are defined as those siRNA that induce less than 50% 
(<50%) target silencing. "Semi-functional siRNA" induce 50-79% target silencing. 

25 "Functional siRNA" are molecules that induce 80-95% gene silencing. "Highly- 
functional siRNA" are molecules that induce greater than 95% gene silencing. 
"Hyperflinctional siRNA" are a special class of molecules. For purposes of this 
document, hyperfunctional siRNA are defined as those molecules that: (1) induce 
greater than 95% silencing of a specific target when they are transfected at 

30 subnanomolar concentrations (/.e., less than one nanomolar); and/or (2) induce 
functional (or better) levels of silencing for greater than 96 hours. These relative 
functionalities (though not intended to be absolutes) may be used to compare siRNAs 
to a particular target for applications such as functional genomics, target identification 
and therapeutics. 
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miRNA 

The temi "miRNA" refers to microRNA. 

5 Gene silencing 

The phrase "gene silencing" refers to a process by which the expression of a 
specific gene product is lessened or attenuated. Gene silencing can take place by a 
variety of pathways. Unless specified otherwise, as used herin, gene silencing refers 
to decreases in gene product expression that results fi-om RNA interference (RNAi), a 

10 defined, though partially characterized pathway whereby small inhibitory RNA 

(siRNA) act in concert with host proteins (e,g, the RNA induced silencing complex, 
RISC) to degrade messenger RNA (mRNA) in a sequence-dependent fashion. The 
level of gene silencing can be measured by a variety of means, including, but not 
limited to, measurement of transcript levels by Northem Blot Analysis, B-DNA 

15 techniques, transcription-sensitive reporter constructs, expression profiling {e.g. DNA 
chips), and related technologies. Altematively, the level of silencing can be measured 
by assessing the level of the protein encoded by a specific gene. This can be 
accomplished by performing a number of studies including Westem Analysis, 
measuring the levels of expression of a reporter protein that has e.g. fluorescent 

20 properties (e.g. GFP) or enzymatic activity (e.g. alkaline phosphatases), or several 
other procedures. 

Transfection 

The term "transfection" refers to a process by which agents are introduced into 
25 a cell. The list of agents that can be transfected is large and includes, but is not 

limited to, siRNA, sense and/or anti-sense sequences, DNA encoding one or more 
genes and organized into an expression plasmid, proteins, protein fi-agments, and 
more. There are multiple methods for transfecting agents into a cell including, but not 
limited to, electroporation, calcium phosphate-based transfections, DEAE-dextran- 
30 based transfections, lipid-based transfections, molecular conjugate-based transfections 
(e.g. polylysine-DNA conjugates), microinjection and others. 
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Target 

The term "target" is used in a variety of different forms throughout this 
document and is defined by the context in which it is used. "Target mRNA" refers to 
a messenger RNA to which a given siRNA can be directed against. "Target sequence" 
5 and "target site" refer to a sequence within the mRNA to which the sense strand of an 
siRNA shows varying degrees of homology and the antisense strand exhibits varying 
degrees of complementarity. The term "siRNA target" can refer to the gene, mRNA, 
or protein against which an siRNA is directed. Similarly "target silencing" can refer 
to the state of a gene, or the corresponding mRNA or protein. 

10 

OfiF-target silencing and Off-target interference 

The phrases "off-target silencing" and "off-target interference" are defined as 
degradation of mRNA other than the intended target mRNA due to overlapping and/or 
partial homology with secondary mRNA messages. 

15 

SMARTscore™ 

The term "SMARTscore™" refers to a number determined by applying any of 
the Formulas I - Formula IX to a given siRNA sequence. The term "SMART- 
selected" or "rationally selected" or "rational selection" refers to siRNA that have 
20 been selected on the basis of their SMARTscores™. 

Complementary 

The term "complementary" refers to^therability of polynucleotides to form 
base pairs with one another. Base pairs are typically formed by hydrogen bonds 

25 between nucleotide units in antiparallel polynucleotide strands. Complementary 

polynucleotide strands can base pair in the Watson-Crick manner (e.g,, A to T, A to 
U, C to G), or in any other manner that allows for the formation of duplexes. As 
persons skilled in the art are aware, when using RNA as opposed to DNA, uracil 
rather than thymine is the base that is considered to be complementary to adenosine. 

30 However; when a U is denoted in the context of the present invention, the ability to 
substitute a T is implied, unless otherwise stated. 

Perfect complementarity or 100% complementarity refers to the situation in 
which each nucleotide unit of one polynucleotide strand can hydrogen bond with a 
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nucleotide unit of a second polynucleotide strand. Less than perfect complementarity 
* refers to the situation in which some, but not all, nucleotide units of two strsinds can 
hydrogen bond with each other. For example, for two 20-mers, if only two base pairs 
on each strand can hydrogen bond with each other, the polynucleotide strands exhibit 

5 10% complementarity. In the same example, if 1 8 base pairs on each strand c£in 
hydrogen bond with each other, the polynucleotide strands exhibit 90% 
complementarity. "Substantial complementarity" refers to polynucleotide strands 
exhibiting 79% or greater complementarity, excluding regions of the polynucleotide 
strands, such as overhangs, that are selected so as to be noncomplementary. 

10 ("Substantial similarity" refers to polynucleotide strands exhibiting 79% or greater 

similarity, excluding regions of the polynucleotide strands, such as overhangs, that are 
selected so as not to be similar.) Thusf for example, two polynucleotides of 29 
nucleotide units each, wherein each comprises a di-dT at the 3' terminus such that the 
duplex region spans 27 bases, and wherein 26 of the 27 bases of the duplex region on 

15 each strand are complementary, are substantially complementary since they are 96.3% 
complementary when excluding the di-dT overhangs. 

Deoxvnucleotide 

The term "deoxynucleptide" refers to a nucleotide or polynucleotide lacking a 
20 hydroxyl group (OH group) at the 2' and/or 3' position of a sugar moiety. Instead, it 
has a hydrogen bonded to the 2' and/or 3' carbon. Within an RNA molecule that 
comprises one or more deoxynucleotides, "deoxynucleotide" refers to the lack of an 
OH group at the 2' position of the sugar moietyv=having.instead a hydrogen bonded 
directly to the 2" carbon. 

25 

Deoxvribonucleotide 

The terms "deoxyribonucleotide" and "DNA" refer to a nucleotide or 
polynucleotide comprising at least one sugar moiety that has an H, rather than an OH, 
at its 2' and/or 3 'position. 

30 

Substantiallv Similar 

The phrase "substantially similar" refers to a similarity of at least 90% with 
respect to the identity of the bases of the sequence. 
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Duplex Region 

The phrase "duplex region" refers to the region in two complementary or 
substantially complementary polynucleotides that form base pairs with one another, 
either by Watson-Crick base pairing or any other manner that allows for a stabilized 
duplex between polynucleotide strands that are complementary or substantially 
complementary. For example, a polynucleotide strand having 21 nucleotide units can 
base pair with another polynucleotide of 21 nucleotide units, yet only 19 bases on 
each strand are complementary or substantially complementary, such that the "duplex 
region" has 19 base pairs. The remaining bases may, for example, exist as 5' and 3' 
overhangs. Further, within the duplex region, 100% complementarity is not required; 
substantial complementarity is allowable within a duplex region. Substantial 
complementarity refers to 79% or greater complementarity. For example, a mismatch 
in a duplex region consisting of 19 base pairs results in 94.7% complementarity, 
rendering the duplex region substantially complementary. 

Nucleotide 

The term "nucleotide" refers to a ribonucleotide or a deoxyribonucleotide or 
modified form thereof, as well as an analog thereof. Nucleotides include species that 
comprise purines, e,g., adenine, hypoxanthine, guanine, and their derivatives and 
analogs, as well as pyrimidines, e.g., cytosine, uracil, thymine, and their derivatives 
and analogs. 

'Nucleotide analogs include nucleotides haxdngrmodifications in the chemical 
structure of the base, sugar and/or phosphate, including, but not limited to, 5-position 
pyrimidine modifications, 8-position purine modifications, modifications at cytosine 
exocyclic amines, and substitution of 5-bromo-uracil; and 2 '-position sugar 
modifications, including but not limited to, sugar-modified ribonucleotides in which 
the 2' -OH is replaced by a group such as an H, OR, R, halo, SH, SR, NH2, NHR, 
NR2, or CN, wherein R is an alkyl moiety. Nucleotide analogs are also meant to 
include nucleotides with bases such as inosine, queuosine, xanthine, sugars such as 
2 '-methyl ribose, non-natural phosphodiester linkages such as methylphosphonates, 
phosphorothioates and peptides. 



Modified bases refer to nucleotide bases such as, for example, adenine, 
guanine, cj^osine, thymine, uracil, xanthine, inosine, and queuosine that have been 
modified by the replacement or addition of one or more atoms or groups. Some 
• examples of types of modifications that can comprise nucleotides that are modified 
5 with respect to the base moieties include but are not limited to, alkylated, halogenated, 
thiolated, aminated, amidated, or acetylated bases, individually or in combination. 
More specific examples include, for example, 5-propynyluridine, 5-propynylcytidine, 
6-methyladenine, 6-methylguanine, N,N,-dimethyladenine, 2-propyladenine, 2- 
propylguanine, 2-aminoadenine, 1 -methylinosine, 3-methyluridine, 5-methylcytidine, 

10 5-methyluridine and other nucleotides having a modification at the 5 position, 5-(2- 
amino)propyl uridine, 5-halocytidine, 5-halouridine, 4-acetylcytidine, 1- 
methyladenosine, 2-methyladenosine, 3-methylcytidine, 6-methyluridine, 2- 
methylguanosine, 7-methylguanosine, 2,2-dimethylguanosine, 5- 
methylaminoethyluridine, 5-methyloxyuridine, deazanucleotides such as 7-deaza- 

15 adenosine, 6-azouridine, 6-azocytidine, 6-azothymidine, 5-methyl-2-thiouridine, other 
thio bases such as 2-thiouridine and 4-thiouridine and 2-thiocytidine, dihydrouridine, 
pseudouridine, queuosine, archaeosine, naphthyl and substituted naphthyl groups, any 
O- and N-alkylated purines and pyrimidines such as N6-methyladenosine, 5- 
methylcarbonylmethyluridine, uridine 5-oxyacetic acid, pyridine-4-one, pyridine-2- 

20 one, phenyl andVmodified phenyl groups such as aminophenol or 2,4,6-trimethoxy 
benzene, modified cytosines that act as G-clamp nucleotides, 8-substituted adenines 
and guanines, 5-substituted uracils and thymines, azapyrimidines, 
carboxyhydroxyalkyl nucleotides, carboxyalkylaminoalkyhnudeotides, and 
alkylcarbonylalkylated nucleotides. Modified nucleotides also include those 

25 nucleotides that are modified with respect to the sugar moiety, as well as nucleotides 
having sugars or analogs thereof that are not ribosyl. For example, the sugar moieties 
may be, or be based on, mannoses, arabinoses, glucopyranoses, galactopyranoses, 4'- 
thioribose, and other sugars, heterocycles, or carbocycles. 

30 The term nucleotide is also meant to include what are known in the art as 

universal bases. By way of example, universal bases include but are not limited to 3- 
nitropyrrole, 5-nitroindole, or nebularine. The term "nucleotide" is also meant to 
include the N3' to P5' phosphoramidate, resulting fi-om the substitution of a ribosyl 3' 
oxygen with an amine group. 
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Further, the term nucleotide also includes those species that have a detectable 
label, such as for example a radioactive or fluorescent moiety, or mass label attached 
to the nucleotide. 

5 

Polynucleotide 

The term "polynucleotide" refers to polymers of nucleotides, and includes but 
is not limited to DNA, RNA, DNA/RNA hybrids including polynucleotide chains of 
regularly and/or irregularly alternating deoxyribosyl moieties and ribosyl moieties 
10 (i.e., wherein alternate nucleotide units have an -OH, then and -H, then an -OH, then 
^ an — H, and so on at the 2' position of a sugar moiety), and modifications of these 
kinds of polynucleotides, wherein the attachment of various entities or moieties to the 
nucleotide units at any position are included. 



15 Polyribonucleotide 

The term "polyribonucleotide" refers to a polynucleotide comprising two or 
more modified or unmodified ribonucleotides and/or their analogs. The term 
"polyribonucleotide" is used interchangeably with the term "oligoribonucleotide." 

20 Ribonucleotide andrribonucleic acid 

The term "ribonucleotide" and the phrase "ribonucleic acid" (RNA), refer to a 
modified or unmodified nucleotide or polynucleotide comprising at least one 
ribonucleotide unit. A ribonucleotide imit comprises an hydBDXkyJtr^roup attached to 
the 2' position of a ribosyl moiety that has a nitrogenous base attached in N- 

25 glycosidic linkage at the 1 ' position of a ribosyl moiety, and a moiety that either 
allows for linkage to another nucleotide or precludes linkage. 



DetaUed Description of the Invention 

30 The present invention is directed to improving the efficiency of gene silencing 

by siRNA. Through the inclusion of multiple siRNA sequences that are targeted to a 
particular gene and/or selecting an siRNA sequence based on certain defined criteria, 
improved efficiency may be achieved. 
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The present invention will now be described in connection with preferred 
embodiments. These embodiments are presented in order to aid in an understanding 
of the present invention and are not intended, and should not be construed, to limit the 
invention in any way. All alternatives, modifications and equivalents that may 
5 become apparent to those of ordinary skill upon reading this disclosure are included 
within the spirit and scope of the present invention. 

Furthermore, this disclosure is not a primer on RNA interference. Basic 
concepts known to persons skilled in the art have not been set forth in detail. 

10 

Optimizing siRNA 

According to one embodiment, the present invention provides a method for 
improving the effectiveness of gene silencing for use to silence a particular gene 
through the selection of an optimal siRNA. An siRNA selected according to this 
15 method may be used individually, or in conjunction with the first embodiment, i.e., 
with one or more other siRNAs, each of which may or may not be selected by this 
criteria in order to maximize their efficiency. 

The degree to which it is possible to select an siRNA for a given mRNA that 
20 maximizes these criteria -will depend on the sequence of the mRNA itself. However, 
the selection criteria will be independent of the target sequence. According to this 
method, an siRNA is selected for a given gene by using a rational design. That said, 
rational design can be described in a variety of ways. Rational-desigsuis,. in simplest 
terms, the application of a proven set of criteria that enhance the probability of 
25 identifying a functional or hyperfunctional siRNA. In one method, rationally 
designed siRNA can be identified by maximizing one or more of the following 
criteria: 



1 . A low GC content, preferably between about 30 -52%. 
30 2. At least 2, preferably at least 3 A or U bases at positions 1 5- 19 of the 

siRNA on the sense strand. 

3. An A base at position 19 of the sense strand. 

4. An A base at position 3 of the sense strand. 

5. A U base at position 10 of the sense strand. 
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6. An A base at position 14 of the sense strand. 

7. A base other than C at position 1 9 of the sense strand. 

8. A base other than G at position 13 of the sense strand. 

9. A Tm, which refers to the character of the internal repeat that results in 
5 inter- or intramolecular structures for one strand of the duplex, that is 

preferably not stable at greater than 50°C, more preferably not stable at 
greater than ST^'C, even more preferably not stable at greater than 30°C 
and most preferably not stable at greater than 20*^0. 

10. A base other than U at position 5 of the sense strand. 
10 11. A base other th£tn A at position 1 1 of the sense strand. 

Criteria 5, 6, 10 and 1 1 are minor criteria, but are nonetheless desirable. 
Accordingly, preferably an siRNA will satisfy as many of the aforementioned criteria 
as possible, more preferably at least 1-4 and 7-9, and most preferably all of the 
15 criteria 

With respect to the criteria, GC content, as well as a high number of AU in 
positions 15-19, may be important for easement of the unwinding of double stranded 
siRNA duplex. Duplex imwinding has been shown to be crucial for siRNA 
20 functionality m v/vo. 

With respect to criterion 9, the intemal structure is measured in terms of the 
melting temperature of the single strand of siRNA, which is the tem^Katuce at which 
50% of the molecules will become denatured. With respect to criteria 2-8 and 10 - 
25 11, the positions refer to sequence positions on the sense strand, which is the strand 
that is identical to the mRNA. 

In one preferred embodiment, at least criteria 1 and 8 are satisfied. In another 
preferred embodiment, at least criteria 7 and 8 are satisfied. In still another preferred 
30 embodiment, at least criteria 1, 8 and 9 are satisfied. 

It should be noted that all of the aforementioned criteria regarding sequence 
position specifics are with respect to the 5' end of the sense strand. Reference is 
made to the sense strand, because most databases contain information that describes 
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the information of the mRNA. Because according to the present invention a chain can 
be from 18 to 30 bases in length, and the aforementioned criteria assumes a chain 19 
base pairs in length, it is important to keep the aforementioned criteria applicable to 
the correct bases. 

5 

When there are only 1 8 bases, the base pair that is not present is the base pair 
that is located at the 3' of the sense strand. When there are twenty to thirty bases 
present, then additional bases are added at the 5' end of die sense chain and occupy 
positions ~1 to ~1 1 . Accordingly, with respect to SEQ. ID NO. 0001 . 
1 0 NNANANNNNUCNAANNNNA and SEQ. ID NO. 0028. 

GUCNNANANNNNUCNAANNNNA, both would have A at position 3, A at 
position 5, U at position 10, C at position 11, A and position 13, A and position 14 
and A at position 19. However, SEQ. ID NO. 0028 would also have C at position -1, 
U at position -2 and G at position —3. 

15 

For a 19 base pair siRNA, an optimal sequence of one of the strands may be 
represented below, where N is any base. A, C, G, or U: 



SEQ. ID NO. 0001. NNANANNNNUCNAANNNNA 
20 SEQ. ID NO. 0002. NNANANNNNUCNAANNNNA 
SEQ. ID NO. 0003. NNANANNNNUUNAANNNNA 
SEQ. ID NO. 0004. NNANANNNNUCNCANNNNA 
SEQ. ID NO. 0005. NNANANNNNUGNCANNNNA 
SEQ. ID NO. 0006. NNANANNNNUUNCANNNNA 
25 SEQ. ID NO. 0007. NNANANNNNUCNUANNNNA 
SEQ. ID NO. 0008.. NNANANNNNUCNUANNNNA 
SEQ. ID NO. 0009. NNANANNNNUUNUANNNNA 
SEQ. ID NO. 0010. NNANCNNNNUCNAANNNNA 
SEQ. ID NO. 001 1. NNANCNNNNUCNAANNNNA 
30 SEQ. ID NO. 001 2. NNANCNNNNUUNAANNNNA 
SEQ. ID NO. 0013. NNANCNNNNUCNCANNNNA 
SEQ. ID NO. 0014. NNANCNNNNUCNCANNNNA 
SEQ. ID NO. 0015. NNANCNNNNUUNCANNNNA 
SEQ. ID NO. 0016. NANCNNNNUCNUANNNNA 
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SEQ. ID NO. 0017. NNANCNNNNUGNUANNNNA 
SEQ. ID NO. 0018. NNANCNNNNUUNUANNNNA 
SEQ. ID NO. 0019. NNANGNNNNUCNAANNNNA 
SEQ. ID NO. 0020. NNANGNNNNUCNAANNNNA 
5 SEQ. ID NO. 002 1 . NNANGNNNNUUNAANNNNA 
SEQ. ID NO. 0022. NNANGNNNNUCNCANNNNA 
SEQ. ID NO. 0023. NNANGNNNNUGNCANNNNA 
SEQ. ID NO. 0024. NNANGNNNNUUNCANNNNA 
SEQ. ID NO. 0025. NNANGNNNNUCNUANNNNA 
10 SEQ. ID NO. 0026. NNANGNNNNUGNUANNNNA 
SEQ. ID NO. 0027. NNANGNNNNNUNUANNNNA 

In one embodiment, the sequence used as an siRNA is selected by choosing 
the siRNA that score highest according to one of the following seven algorithms that 
15 are represented by Formulas I - VII: 

Formula I 

Relative functionality of siRNA= -(GC/3) +(AUi5.i9) -(Tm2o°c)*3 -(Gi3)*3 -(C19) 
+(A,9)*2 +(A3) +(U,o)+(A,4) -(Us) -(All) 

20 

Formula II 

Relative fVmctionality of siRNA= -(GC/3) -(AUi5.i9)*3 -(Gi3)*3 -(C19) +(Ai9)*2 

*«»w*=s^»i?'J»^ "'■(A3) ' *w«*^B«>»»^ 

25 Formula III 

Relative functionality of siRNA= -(GC/3) +(AUi5.i9) -(Tm2o<>c)*3 

Formula FV 

Relative functionality of siRN A= 
30 -GC/2+( AU,5-i9)/2-(Tm2(rc)*2 -(G,3)*3 -(C,,) +(A,9)*2 -KA3) +(U,o)+(Ai4) -(Us) - 
(An) 



Formula V 
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Relative functionality of siRNA=-(Gi3)*3 -(C19) +(Ai9)*2 +(A3) + (Uio)+(Ai4) -(IJs) 
-(An) 



Formula VI 

Relative functionality of siRNA=-(Gi3)*3 -(C19) +(Ai9)*2 +(A3) 
Formula VII 

Relative functionality of siRNA=-(GC/2) +(AUi5^i9)/2 -( Tm2o-c)*l -(Gi3)*3 -(C19) 
+(Ai9)*3 +(A3)*3 +(U,o)/2+(Ai4)/2 -(U5)/2 -(Ai ,)/2 

In Formulas I — VII: 

wherein A19 = 1 if A is the base at position 19 on the sense strand, otherwise its value 
is 0, 

AU15.19 = 0 — 5 depending on the number of A or U bases on the sense strand 

at 

positions 15-19; 

Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 

0; 

Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise its value is 
0; .-v 

GC= the number of G and C bases in the entire sense strand; 
Tm 20°c=l if the Tm is greater than 20''C; 

A3= 1 if A is the base at position 3 on the sense strand, otherwise its value is 0; 
Uio= 1 if U is the base at position 10 on the sense strand, otherwise its value is 

0; 

A 14 = 1 if A is the base at position 14 on the sense strand, otherwise its value 

is 0; 

U5 = 1 if U is the base at position 5 on the sense strand, otherwise its value is 

0; and 

All = 1 if A is the base at position 1 lof the sense strand, otherwise its value is 

0. 

Formulas I -VII provide relative information regarding functionality. When 
the values for two sequences are compared for a given formula, the relative 
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functionality is ascertained; a higher positive number indicates a greater fiinctionaHty. 
For example, in many applications a value of 5 or greater is beneficial. 

Additionally, in many applications, more than one of these formulas would 
5 provide useful information as to the relative functionality of potential siRNA 

sequences. However, it is beneficial to have more than one type of formula, because 
not every formula will be able to help to differentiate among potential siRNA 
sequences. For example, in particularly high GC mRNAs, formulas that take that 
parameter into account would not be useful and application of formulas that lack GC 

10 elements {e.g., formulas V and VI) might provide greater insights into duplex 
functionality. Similarly, formula II might by used in situations where hairpin 
structures are not observed in duplexes, and formula IV might be applicable for 
sequences that have higher AU content. Thus, one may consider a particular sequence 
in light of more than one or even all of these algorithms to obtain the best 

1 5 differentiation among sequences. In some instances, application of a given algorithim 
may identify an unususally large number of potential siRNA sequences, and in those 
cases, it may be appropriate to re-analyze that sequence with a second algorithm that 
is, for instance, more stringent. Alternatively, it is conceivable that analysis of a 
sequence with a given formula yields no acceptable siRNA sequences (i.e. low 

20 SMARTscores*^^). In this instance, it may^be appropriate to re-analyze that sequences 
with a second algorithm that is, for instance, less stringent. In still other instances, 
analysis of a single sequence with two separate formulas may give rise to conflicting 
,:^t,.**-. T> i-.^ results (i.e. one formula generates a set of siRNA with high SMARTscores™ while^^. .^-t - 
the other formula identifies a set of siRNA with low SMARTscores^^). In these 

25 instances, it may be necessary to determine which weighted factor(s) (e.g. GC 

content) are contributing to the discrepancy and assessing the sequence to decide 
whether these factors should or should not be included. Altematively, the sequence 
could be analyzed by a third, fovuth, or fifth algorithm to identify a set of rationally 
designed siRNA. 

30 

The above-referenced criteria are particularly advantageous when used in 
combination with pooling techniques as depicted in Table I: 
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Table I 



Criteria 


Functional Probability 


Oligos 


Pools 


>95% 


>80% 


<70% 


>95% 


>80% 


<70% 


Current 


33.0 


50.0 


23.0 


79.5 


97.3 


0.3 


New 


50.0 


88.5 


8.0 


93.8 


99.98 


0.005 


(GC) 


28.0 


58.9 


36.0 


72.8 


97.1 


1.6 



The term "current" refers to Tuschl's conventional siRNA parameters (Elbashir, S.M. 
et ah (2002) "Analysis of gene function in somatic mammalian cells using small 
5 interfering RNAs" Methods 26: 199-213). "New" refers to the design parameters 
described in Formulas I-VIL "GC" refers to criteria that select siRNA solely on the 
basis of GC content. 

As Table I indicates, when more functional siRNA duplexes are chosen, 
10 siRNAs that produce <70% silencing drops from 23% to 8% and the number of 

siRNA duplexes that produce >80% silencing rises from 50% to 88.5%. Further, of 
the siRNA duplexes with >80% silencing, a larger portion of these siRNAs actually 
silence >95% of the target expression (the new criteria increases the portion from 
33% to 50%). Using this new criteria in pooled^^iRNAs, shows that, with pooling, the 
15 amount of silencing >95% increases from 79.5% to 93.8% and essentially eliminates 
any. siRNA pool from silencing less than 70%. 

Table II similarly shows the particuleirly beneficial results of pooling in 
combination with the aforementioned criteria. However, Table II, which takes into 
20 account each of the aforementioned variables, demonstrates even a greater degree of 
improvement in functionality. 

Table II 





Functional Probability 


Oligos 


Pools 


Functional 


Average 


Non- 
functional 


Functional 


Average 


Non- 
functional 


Random 


20 


40 


50 


67 


97 


3 
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Criteria 1 


52 


99 


0.1 


97 


93 


0.0040 


Criteria 4 


89 


99 


0.1 


99 


99 


0.0000 



The terms "functional," "Average," and "Non- functional" refer to siRNA that exhibit 
>80%, >50%, and <50% functionality, respectively. Criteria 1 and 4 refer to specific 
criteria described above. 

5 

The above-described algorithms may be used with or without a computer 
program that allows for the inputting of the sequence of the mRNA and automatically 
outputs the optimal siRNA. The computer program may, for example, be accessible 
from a local terminal or personal computer, over an internal network or over the 
10 Internet. 



In addition to the formulas above, more detailed algorithms may be used for 
selecting siRNA. Preferably, at least one RNA duplex of between 18 and 30 base 
pairs is selected such that it is optimized according a formula selected from: 

15 



Formula VIII: (-14)*Gi3-13*A,-12*U7-ll*U2-10*Aii-10*U4-10*C3-10*C5-10*C6- 
9*Aio-9*U9-9*C,8-8*G,o-7*Ui-7*^lIIi6-7*Ci7-7*Ci9 
-h7*U,7+8*A2+8*A4 +8*A5+8*C4 +9*G8 +10*A7+1 0*Ui8+l 1 *Ai9+ 
20 ll*C9+15*Gi+ 18*A3+19*Uio-Tm-3* (GCtotai) ~ 6*(GCi5-i9)- 

— • 30*X; and 



Formula IX: (14.1)*A3+(14.9)*A6+(17.6)*Ai3+(24.7)*Ai9+(14.2)*Uio+(10.5)* 
C9+(23.9)*Gi+(16.3)*G2+(-12.3)*Aii-h(-19.3)*Ui+(-12.1)*U2+ 
25 (-ll)*U3+(-15.2)*U,5+(-11.3)*Ui6+(-11.8)*C3+(-17.4)*C6+{- 

10.5)*C7+ (-13.7)*Gi3+(-25.9)*Gi9-Tm-3* (GCtotai) - 6*(GCi5-i9)- 
30*X 

wherein 

30 Ai = 1 if A is the base at position 1 of the sense strand, otherwise its value is 0; 
A2 = 1 if A is the base at position 2 of the sense strand, otherwise its value is 0; 
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A3 = 1 if A is the base at position 3 of the sense strand, otherwise its value is 0; 
A4 = 1 if A is the base at position 4 of the sense strand, otherwise its value is 0; 
A5 = 1 if A is the base at position 5 of the sense strand, otherwise its value is 0; 
A6 = 1 if A is the base at position 6 of the sense strand, otherwise its value is 0; 
5 A7 = 1 if A is the base at position 7 of the sense strand, otherwise its value is 0; 
Aio= 1 if A is the base at position 10 of the sense strand, otherwise its value is 0; 
Ai I = 1 if A is the base at position 1 1 of the sense strand, otherwise its value is 0; 
A|3 = 1 if A is the base at position 13 of the sense strand, otherwise its value is 0; 
A19 = 1 if A is the base at position 19 of the sense strand, otherwise if another base is 
10 present or the sense strand is only 1 8 base pairs in length, its value is 0; 

C3 = 1 if C is the base at position 3 of the sense strand, otherwise its value is 0; 

C4 = 1 if C is the base at position 4 of the sense strand, otherwise its value is 0; 

C5 = 1 if C is the base at position 5 of the sense strand, otherwise its value is 0; 
15 C6 = 1 if C is the base at position 6 of the sense strand, otherwise its value is 0; 

C7 = 1 if C is the base at position 7 of the sense strand, otherwise its value is 0; 

C9 = 1 if C is the base at position 9 of the sense strand, otherwise its value is 0; 

Ci7 = 1 if C is the base at position 17 of the sense strand, otherwise its value is 0; 

C18 = 1 if C is the base at position 18 of the sense strand, otherwise its value is 0; 
20 Ci9 = 1 if C is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 

Gi = 1 if G is the base at position 1 on the sense strand, otherwise its value is 0; 
G2 = 1 if G is the base at position 2 of the sense strand, otherwise its value is 0; 
25 Gs = 1 if G is the base at position 8 on the sense strand, otherwise its value is 0; 

Gio= 1 if G is the base at position 10 on the sense strand, otherwise its value is 0; - 
Gi3 = 1 if G is the base at position 13 on the sense strand, otherwise its value is 0; 
Gi9= 1 if G is the base at position 19 of the sense strand, otherwise if another base is 
present or the sense strand is only 1 8 base pairs in length, its value is 0; 
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Ui = 1 if U is the base at position 1 on the sense strand, otherwise its value is 0; 
U2 = 1 if U is the base at position 2 on the sense strand, otherwise its value is 0; 
U3 = 1 if U is the base at position 3 on the sense strand, otherwise its value is 0; 
U4 = 1 if U is the base at position 4 on the sense strand, otherwise its value is 0; 
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Uy = 1 if U is the base at position 7 on the sense strand, otherwise its value is 0; 
U9 = 1 if U is the base at position 9 on the sense strand, otherwise its value is 0; 
Uio =1 if U is the base at position 10 on the sense strand, otherwise its value is 0; 
Ui5 = 1 if U is the base at position 1 5 on the sense strand, otherwise its value is 0; 
5 U16 = 1 if U is the base at position 16 on the sense strand, otherwise its value is 0; 
Ui7 = 1 if U is the base at position 17 on the sense strand, otherwise its value is 0; 
U18 = 1 if U is the base at position 1 8 on the sense strand, otherwise its value is 0; 

GC15-19 = the number of G and C bases within positions 15 - 19 of the sense 
1 0 strand, or within positions 1 5 -1 8 if the sense strEind is only 1 8 base pairs in 

length; 

GCtotai = the number of G and C bases in the sense strand; 
Tm = 100 if the siRNA oUgo has the internal repeat longer then 4 base pairs, 
otherwise its value is 0; and 
.15 X = the number of times that the same nucleotide repeats four or more times in a 
row. 

The above formulas VIII and IX, as well as formulas I - VII, provide methods 
for selecting siRNA in order to incresise the efficiency of gene silencing. A subset of 
20 variables of any of the formulas may be used, though when-^ewer variables are used, 
the optimization hierarchy becomes less reliable. 

. » With respect to the variables of the above-referenced formuleis, a single letter .vi 
of A or C or G or U followed by a subscript refers to a binary condition. The binary 

25 condition is that either the particular base is present at that particular position 
(wherein the value is "1") or the base is not present (wherein the value is "0")- 
Because position 19 is optional, i.e. there might be only 18 base pairs, when there are 
only 1 8 base pairs, any base with a subscript of 19 in the formulas above would have 
a zero value for that parameter. Before or after each variable is a number followed by 

30 *, which indicates that the value of the variable is to be multiplied or weighed by that 
number. 

The numbers preceding the variables A, or G, or C, or U in Formulas VIII and 
IX (or after the variables in Formula I - VII) were determined by comparing the 
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difference in the frequency of individual bases at different positions in functional 
siRNA and total siRNA. Specifically, the frequency in which a given base was 
observed at a particular position in functional groups was compared with the 
frequency that that same base was observed in the total, randomly selected siRNA set. 
If the absolute value of the difference between the functional and total values was 
foimd to be greater than 6%, that parameter was included in the equation. Thus for 
instance, if the frequency of finding a "G" at position 13 (G 13) is found to be 6% in a 
given functional group, and the frequency of Gn in the total population of siRNAs is 
20%, the difference between the two values is 6%-20% = -14%. As the absolute value 
is greater than six (6), this factor (-14) is included in the equation. Thus in Formula 
VIII, in cases where the siRNA under study has a G in position 13, the accrued value 
is (-14) * (1) = -14. In contrast, when a base other than G is found at position 13, the 
accrued value is (-14) * (0) = 0. 

When developing a means to optimize siRNAs, the inventors observed that a 
bias toward low internal thermodynamic stability of the duplex at the 5'-antisense 
(AS) end is characteristic of naturally occurring miRNA precursors. The inventors 
extended this observation to siRNAs for which functionality had been assessed in 
tissue culture. 

With respect to the parameter GC15-19, a value of 0 - 5 will be ascribed 
depending on the number of G or C bases at positions 15 to 19. If there are only 18 
base pairs^ thcvalue is between 0 and 4. 

With respect to the criterion GCtotai content, a number from 0-30 will be 
ascribed, which correlates to the total number of G and C nucleotides on the sense 
strand, excluding overhangs. Without wishing to be bound by any one theory, it is 
postulated that the significance of the GC content (as well as AU content at positions 
15-19, which is a parameter for formulas III - VII) relates to the easement of the 
imwinding of a double-stranded siRNA duplex. Duplex unwinding is believed to be 
crucial for siRNA functionality in vivo and overall low internal stability, especially 
low internal stability of the first unwound base pair is believed to be important to 
maintain sufficient processivity of RISC complex-induced duplex unwinding. If the 
duplex has 19 base pairs, those at positions 15-19 on the sense strand will unwind first 
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if the molecule exhibits a sufficiently low internal stability at that position. As 
persons skilled in the art are aware, RISC is a complex of approximately twelve 
proteins; Dicer is one, but not the only, helicase within this complex. Accordingly, 
although the GC parameters are believed to relate to activity with Dicer, they are also 
5 important for activity with other RISC proteins. 



The value of the parameter Tm is 0 when there are no internal repeats longer 
than (or equal to) four base pairs present in the siRNA duplex; otherwise the value is 
1 . Thus for example, if the sequence ACGUACGU, or any other four nucleotide (or 
10 more) palindrome exists within the structure, the value will be one (1). Alternatively 
if the structure ACGGACG, or any other 3 nucleotide (or less) palindrome exists, the 
value will be zero (0). 



The variable "X" refers to the number of times that the same nucleotide occurs 
1 5 contiguously in a stretch of four or more units. If there are, for example, four 
contiguous As in one part of the sequence and elsewhere in the sequence four 
contiguous Cs, X =2. Further, if there are two separate contiguous stretches of four of 
the same nucleotides or eight or more of the same nucleotides in a row, then X=2. 
However, X does not increase for five, six or seven contiguous nucleotides. 

20 

Again, when applying Formula VIII or Formula IX to a given mRNA, (the 
"target RNA" or "target molecule"), one may use a computer program to evaluate the 
criteria for every^sequence of 18 - 30 base pairs or only sequences of a fixed length, 
eg., 19 base pairs. Preferably the computer program is designed such that it provides 

25 a report ranking of all of the potential siRNAs between 1 8 and 30 base pairs, ranked 
according to which sequences generate the highest value. A higher value refers to a 
more efficient siRNA for a particular target gene. The computer program that may be 
used, may be developed in any computer language that is known to be useful for 
scoring nucleotide sequences, or it may be developed with the assistemce of 

30 commercially available product such as Microsoft's product .net. Additionally, rather 
than run every sequence through one and/or another formula, one may compare a 
subset of the sequences, which may be desirable if for example only a subset are 
available. For instance, it may be desirable to first perform a BLAST (Basic Local 
Alignment Search Tool) search and to identify sequences that have no homology to 
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other targets. Alternatively, it may be desirable to scan the sequence and to identify 
regions of moderate GC context, then perform relevant calculations using one of the 
above-described formulas on these regions. These calculations can be done manually 
or with the aid of a computer. 

5 

As with Formulas I - VII, either Formula VIII or Formula IX may be used for 
a given mRNA target sequence. However, it is possible that according to one or the 
other formula more than one siRNA will have the same value. Accordingly, it is 
beneficial to have a second formula by which to differentiate sequences. Formula IX 

10 was derived in a similar fashion as Formula VIII, yet used a larger data set and thus 

yields sequences with higher statistical correlations to highly functional duplexes. The 
sequence that has the highest value ascribed to it may be referred to as a "first 
optimized duplex." The sequence that has the second highest value ascribed to it may 
be referred to as a "second optimized duplex." Similarly, the sequences that have the 

1 5 third and fourth highest values ascribed to them may be referred to as a third 

optimized duplex and a fovirth optimized duplex, respectively. When more than one 
sequence has the same value, each of them may, for example, be referred to as first 
optimized duplex sequences or co-first optimized duplexes. 

20 SiRNA sequences identified using Formula VIII are contained-within the 

enclosed compact disks. The data included on the enclosed compact disks is 
described more fully below. The sequences identified by Formula VIII that are 
disclosed in the^compacts^disks may be used in gene silencing applications. 

25 It should be noted that for Formulas VIII and IX all of the aforementioned 

criteria are identified as positions on the sense strand when oriented in the 5' to 3' 
direction as they are identified in connection with Formulas I - VII unless otherwise 
specified. 

30 Formulas I - IX, may be used to select or to evaluate one, or more than one, 

siRNA in order to optimize silencing. Preferably, at least two optimized siRNAs that 
have been selected according to at least one of these formulas are used to silence a 
gene, more preferably at least three and most preferably at least four. The siRNAs 
may be used individually or together in a pool or kit. Further, they may be applied to 
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a cell simultaneously or separately. Preferably, the at least two siRNAs are applied 
simultaneously. Pools are particularly beneficial for many research applications. 
However, for therapeutics, it may be more desirable to employ a single 
hyperfimctional siRNA as described elsewhere in this application. 

5 

When planning to conduct gene silencing, and it is necessary to choose 
between two or more siRNAs, one should do so by comparing the relative values 
when the siRNA are subjected to one of the formulas above. In general a higher 
scored siRNA should be used. 

10 

Useful applications include, but are not limited to, target validation, gene 
functional analysis, research and drug discovery, gene therapy and therapeutics. 
Methods for using siRNA in these applications are well known to persons of skill in 
the art. 

15 

Because the ability of siRNA to function is dependent on the sequence of the 
RNA and not the species into which it is introduced, the present invention is 
applicable across a broad range of species, including but not limited to all mammalian 
species, such as humans, dogs, horses, cats, cows, mice, hamsters, chimpanzees and 
20 gorillas, as well as other species and organisms such as bacteria, viruses, insects, 
plants and C. elegans. 



The presentinj^entioiiLis also applicable for use for silencing a broad range of 
genes, including but not limited to the roughly 45,000 genes of a human genome, and 
25 has particular relevance in cases where those genes are associated with diseases such 
as diabetes, Alzheimer's, cancer, as well as all genes in the genomes of the 
aforementioned organisms. 



The siRNA selected according to the aforementioned criteria or one of the 
30 aforementioned algorithms are also, for example, useful in the simultaneous screening 
and functional analysis of multiple genes and gene families using high throughput 
strategies, as well £is in direct gene suppression or silencing. 

Development of the Algorithms 
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To identify siRNA sequence features that promote functionality and to 
quantify the importance of certain currently accepted conventional factors — such as 
G/C content and target site accessibility — ^the inventors synthesized an siRNA panel 
consisting of 270 siRNAs targeting three genes, Human Cyclophilin, Firefly 
5 Luciferase, and Human DBI. In all three cases, siRNAs were directed against specific 
regions of each gene. For Human Cyclophilin and Firefly Luciferase, ninety siRNAs 
were directed against a 199 bp segment of each respective mRNA. For DBI, 90 
siRNAs were directed against a smaller, 109 base pair region of the mRNA. The 
sequences to which the siRNAs were directed are provided below. 

10 

It should be noted that in certain sequences, "t" is present. This is because 
many databases contain information in this manner. However, the t denotes a uracil 
residue in mRNA and siRNA. Any algorithm will, unless otherwise specified, 
process a t in a sequence as a u. 

15 

Human cvclophilin: 193 — 390, M60857 
SEQ. ID NO. 29: 

gttccaaaaacagtggataattttgtggccttagctacaggagagaaaggatttggctacaaaaacagcaaattccatcgtgt 
aatcaaggacttcatgatccagggcggagacttcaccaggggagatggcacaggaggaaagagcatctacggtgagcg 
20 cttccccgatgagaacttcaaactgaagcactacgggcctggctggg 

Fireflv luciferase: 1434 — 1631. U47298 fpGL3. Promega^ 
SEQ. ID NO. 30: 

tgaacttcccgccgccgttgttgttttggagcacggaaagacgatgacggaaaaagagatcgtggattacgtcgccagtca 
25 agtaacaaccgcgaaaaagttgcgcggaggagttgtgtttgtggacgaagtaccgaaaggtcttaccggaaaactcgacg 
caagaaaaatcagagagatcctcataaaggccaagaagg 

DEL NM_020548 ( 202-310) (every position) 
SEQ. ID NO. 0031: 

3 0 acgggcaaggccaagtgggatgcctggaatgagctgaaagggacttccaaggaagatgccatgaaagcttacatcaaca . 
aagtagaagagctaaagaaaaaatacggg 

A list of the siRNAs appears in Table III (see Examples Section, Example II) 
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The set of duplexes was analyzed to identify correlations between siRNA 
functionality and other biophysical or thermodynamic properties. When the siRNA 
panel was analyzed in functional and non-functional subgroups, certain nucleotides 
were much more abundant at certain positions in fimctional or non-functional groups. 
5 More specifically, the frequency of each nucleotide at each position in highly 
functional siRNA duplexes was compared with that of nonfunctional duplexes in 
order to assess the preference for or against any given nucleotide at every position. 
These analyses were used to determine important criteria to be included in the siRNA 
algorithms (Formulas VIII and IX). 

10 

The data set was also analyzed for distinguishing biophysical properties of 
siRNAs in the functional group, such as optimal percent of GC content, propensity for 
internal structures and regional thermodynamic stability. Of the presented criteria, 
several are involved in duplex recognition, RISC activation/duplex unwinding, and 
1 5 target cleavage catalysis. 

The original data set that was the source of the statistically derived criteria is 
shown in Figure 2. Additionally, this figure shows that random selection yields 
siRNA duplexes with unpredictable and widely varying silencing potencies as 

20 measured in tissue culture using HEK293 cells. In the figure, duplexes are plotted 

such that each x-axis tick-mark represents an individual siRNA, with each subsequent 
siRNA differing in target position by two nucleotides for Human Cyclophilin and 
Firefly Luciferase, and by oneTnueleotide for Human DBI. Furthermore, the y-axis 
denotes the level of target expression remaining after transfection of the duplex into 

25 cells and subsequent silencing of the target. 

SiRNA identified and optimized in this document work equally well in a wide 
range of cell types. Figure 3a shows the evaluation of thirty siRNAs targeting the 
DBI gene in three cell lines derived from different tissues. Each DBI siRNA displays 
30 very similar functionality in HEK293 (ATCC, CRL-1 573, human embryonic kidney), 
HeLa (ATCC, CCL-2, cervical epithelial adenocarcinoma) and DU145 (HTB-81, 
prostate) cells as deterimined by the B-DNA assay. Thus, siRNA functionality is 
determined by the primary sequence of the siRNA and not by the intracellular 
environment. Additionally, it should be noted that although the present invention 
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provides for a determination of the functionality of siRNA for a given target, the same 
siRNA may silence more than one gene. For example, the complementary sequence 
of the silencing siRNA may be present in more than one gene. Accordingly, in these 
circumstances, it may be desirable not to use the siRNA with highest SMARTscore™. 
5 In such circumstances, it may be desirable to use the siRNA with the next highest 
SMARTscore™. 

To determine the relevance of G/C content in siRNA function, the G/C content 
of each duplex in the panel was calculated and the functional classes of siRNAs 
10 (<F50, > F50, > F80, > F95 where F refers to the percent gene silencing) were sorted 
accordingly. The majority of the highly- functional siRNAs (>F95) fell within the 
G/C content range of 36% — 52% (Figure 3B). Twice as many non-functional (< 
F50) duplexes fell within the high G/C content groups (>57% GC content) compared 
to the 36% — 52% group. The group with extremely low GC content (26% or less) 
15 contained a higher proportion of non-functional siRNAs and no highly- functional 
siRNAs. The G/C content range of 30% — 52% was therefore selected as Criterion I 
for siRNA functionality, consistent with the observation that a G/C range 30% — 70% 
promotes efficient RNAi targeting. Application of this criterion alone provided only a 
marginal increase in the probability of selecting functional siRNAs from the panel: 
^:20 selection of F50 and F95 siRNAs was improved by 3.6% and 2.2%, respectively. The: 
siRNA panel presented here permitted a more systematic analysis and quantification 
of the importance of this criterion than that used previously. 

A relative measure of local internal stability is the A/U base pair (bp) content; 

25 therefore, the frequency of A/U bp was determined for each of the five terminal 
positions of the duplex (5' sense (S)/5' antisense (AS)) of all siRNAs in the panel. 
Duplexes were then categorized by the number of A/U bp in positions 1 — 5 and 15 — 
19 of the sense strand. The thermodynamic flexibility of the duplex 5'-end (positions 
1 — 5; S) did not appear to correlate appreciably with silencing potency, while that of 

30 the 3 '-end (positions 15 — 19; S) correlated with efficient silencing. No duplexes 

lacking A/U bp in positions 1 5 — 1 9 were functional. The presence of one A/U bp in 
this region conferred some degree of functionality, but the presence of three or more 
A/Us was preferable and therefore defined as Criterion II. When applied to the test 
panel, only a marginal increase in the probability of functional siRNA selection was 
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achieved: a 1.8% and 2.3% increase for F50 and F95 duplexes, respectively (Table 
IV). 



The complementary strands of siRNAs that contain intemal repeats or 
5 palindromes may form intemal fold-back structures. These hairpin-like structures 
exist in equilibrium with the duplexed form effectively reducing the concentration of 
functional duplexes. The propensity to form intemal hairpins and their relative 
stability can be estimated by predicted melting temperatures. High Tm reflects a 
tendency to form hairpin structures. Lower Tm values indicate a lesser tendency to 

10 form hairpins. When the functional classes of siRNAs were sorted by (Figure 3c), 
the following trends were identified: duplexes lacking stable intemal repeats were the 
most potent silencers (no F95 duplex with predicted hairpin structure > 60 ^C). In 
contrast, about 60% of the duplexes in the groups having intemal hairpins with 
calculated T;;, values less than 20 ""C were F80. Thus, the stability of intemal repeats 

15 is inversely proportional to the silencing effect and defines Criterion III (predicted 
hairpin stmcture T„ < 20 ""C). 

Sequence-based determinants of siRNA functionality 

When the siRNA panel was sorted into functional and non-functional groups, 

20.1 the firequency of a specific nucleotide at each position in a functional siRNA duplex 
was compared with that of a nonfunctional duplex in order to assess the preference for 
or against a certain nucleotide. Figure 4 shows the results of these queries and the 
subsequent resorting of the data set .(firomvEigure 2). The data is separated into two 
sets: those duplexes that meet the criteria, a specific nucleotide in a certain position - 

25 grouped on the left (Selected) and those that do not - grouped on the right 

(Eliminated). The duplexes are further sorted from most fimctional to least functional 
with the y-axis of Figure 4a-e representing the % expression Le, the amount of 
silencing that is elicited by the duplex (Note: each position on the X-axis represents a 
different duplex). Statistical analysis revealed correlations between silencing and 

30 several sequence-related properties of siRNAs. Figure 4 and Table IV show 

quantitative analysis for the following five sequence-related properties of siRNA: (A) 
an A at position 19 of the sense strand; (B) an A at position 3 of the sense strand; (C) 
a U at position 10 of the sense strand; (D) a base other than G at position 13 of the 
sense strand; and (E) a base other than C at position 19 of the sense strand. 
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When the siRNAs in the panel were evaluated for the presence of an A at 
position 19 of the sense strand, the percentage of non- functional duplexes decreased 
from 20% to 1 1.8%, and the percentage of F95 duplexes increased from 21.7% to 
5 29.4% (Table IV). Thus, the presence of an A in this position defined Criterion IV. 

Another sequence-related property correlated with silencing was the presence 
of an A in position 3 of the sense strand (Figure 4b). Of the siRNAs with A3, 34.4% 
were F95, compared with 21 .7% randomly selected siRNAs. The presence of a U 
10 base in position 10 of the sense strand exhibited an even greater impact (Figure 4c). 
Of the duplexes in this group, 41 .7% were F95. These properties became criteria V 
and VI, respectively. ^ 



Two negative sequence-related criteria that were identified also appear on 
15 Figure 4. The absence of a G at position 13 of the sense strand, conferred a marginal 
increase in selecting fiinctional duplexes (Figure 4d). Similarly, lack of a C at 
position 19 of the sense strand also correlated with functionality (Figure 4e). Thus, 
among fiinctional duplexes, position 19 was most likely occupied by A, and rarely 
occupied by C. These rules were defined as criteria VII and VIII, respectively. 

20 

Application of each criterion individually provided marginal but statistically 
significant increases in the probability of selecting a potent siRNA. Although the 
results were informative, the inventors soughtitojnaximize potency and therefore 
consider multiple criteria or parameters. Optimization is particularly important when 
25 developing therapeutics. Interestingly, the probability of selecting a functional siRNA 
based on each thermodynamic criteria was 2% — 4% higher than random, but 4% — 
8% higher for the sequence-related determinates. Presumably, these sequence-related 
increases reflect the complexity of the RNAi mechanism and the multitude of protein- 
RNA interactions that are involved in RNAi-mediated silencing. 

30 
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Table IV 



Impr V ment 



15 



20 



Ciit lion 


% Functional 


over Random 


1. 30%— 52% G/C content 


< F50 


16.4% 


-3.6% 




> F50 


83.6% 


3.6% 




> F80 


60,4% 


4.3% 




> F95 


23.9% 


2.2% 


II. At least 3 A/U bases at positions 


< F50 


18.2% 


-1.8% 


15 — 1 9 of the sense strand 


> F50 


81.8% 


1.8% 




> F80 


59.7% 


3.6% 




> F95 


24.0% 


2.3% 


III. Absence of internal repeats. 


<F50 


16.7% 


-3.3% 


as measured by of 


> F50 


83,3% 


3.3% 


secondary structure < 20**C 


> F80 


61.1% 


5.0% 




> F95 


24.6% 


2.9% 


IV. An A base at position 19 


<F50 


11.8% 


-8.2% 


of the sense strand 


> F50 


88.2% 


8.2% 




> F80 


75.0% 


18.9% 




> F95 


29.4% 


7.7% 


V. An A base at position 3 


<F50 


17.2% 


-2.8% 


of the sense strand 


> F50 


82.8% 


2.8% 




> F80 


62.5% 


6.4% 




> F95 


34.4% 


12.7% 


VI. A U base at position 10 


<F50 


13.9% 


-6.1% 


of the sense strand 


>F50 


86.1% 


6.1% 




>F80 


69.4% 


13.3% 




> F95 


41 .7% 


20% 


VII. A base other than C at 


<F50 


18.8% 


-1.2% 


position 1 9 of the sense strand 


>F50 


81 .2% 


1.2% 




>F80 


59.7% 


3.6% 




>F95 


24.2% 


2.5% 


VIM. A base other than G at 


<F50 


15.2% 


-4.8% 


position 1 3 of the sense strand 


> F50 


84.8% 


4.8% 




>F80 


61.4% 


5.3% 




>F95 


26.5% 


4.8% 



25 

The siRNA selection algorithm 

In an effort to improve selection further, all identified criteria, including but 
not limited to those listed in Table IV were combined into the algorithms embodied in 
Formula VIII and Formula IX. Each siRNA was then assigned a score (referred to as 
30 a SMARTscore™) according to the values derived from the formulas. Duplexes that 
scored higher than 0 or 20, for Formulas VIII and EX, respectively, effectively 
selected a set of functional siRNAs and excluded all non- functional siRNAs. 
Conversely, all duplexes scoring lower than 0 and 20 according to formulas VIII and 
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IX, respectively, contained some functional siRNAs but included aU non-functional 
siRNAs. A graphical representation of this selection is shown in Figure 5. 

The methods for obtaining the seven criteria embodied in Table IV are 
5 illustrative of the results of the process used to develop the information for Formulas 
VIII and IX. Thus similar techniques were used to establish the other variables and 
their multipliers. As described above, basic statistical methods were use to determine 
the relative values for these multipliers. 

10 To determine the value for "Improvement over Random" the difference in the 

frequency of a given attribute (e,g, GC content, base preference) at a particular 
position is determined between individual functional groups {e.g. <F50) and the total 
siRNA population studied (e.g. 270 siRNA molecules selected randomly). Thus, for 
instance, in Criterion I (30%-52% GC content) members of the <F50 group were 

15 observed to have GC contents between 30-52% in 16.4% of the cases. In contrast, the 
total group of 270 siRNAs had GC contents in this range, 20% of the time. Thus for 
this particular attribute, there is a small negative correlation between 30%-52% GC 
content and this functional group (/.e. 16.4%-20% = -3.6%). Similarly, for Criterion 
VI, (a "U" at position 10 of the sense strand), the >F95 group contained a "U" at this 

20 position 41 :7% of the time. In contrast, the total group of 270 siRNAs had a "U" at 

this position 21 .7% of the time, thus the improvement over random is calculated to be 
20% (or 41.7%-21.7%). 

Identifying The Average Intemal Stability Profile of Strong siRNA 
25 In order to identify an intemal stability profile that is characteristic of strong 

siRNA, 270 different siRNAs derived from the cyclophilin B, the diazepam binding 
inhibitor (DBI), and the luciferase gene were individually transfected into HEK293 
cells and tested for their ability to induce RNAi of the respective gene. Based on their 
performance in the in vivo assay, the sequences were then subdivided into three 
30 groups, (i) >95% silencing; (ii) 80-95% silencing; and (iii) less than 50% silencing. 
Sequences exhibiting 51-84% silencing were eliminated from further consideration to 
reduce the difficulties in identifying relevant thermodynamic pattems. 
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Following the division of siRNA into three groups, a statistical analysis was 
performed on each member of each group to determine the average internal stability 
profile (AISP) of the siRNA. To accomplish this the Oligo 5.0 Primer Analysis 
Software and other related statistical packages (e,g. Excel) were exploited to 
5 determine the internal stability of pentamers using the nearest neighbor method 

described by Freier et al., (1986) Improved free-energy parameters for predictions of 
RNA duplex stability, ProcNatl. Acad. Sci. U. S. A. 83(24): 9373-7. Values for each 
group at each position were then averaged, and the resulting data were graphed on a 
linear coordinate system with the Y-axis expressing the AG (free energy) values in 
10 kcal/mole and the X-axis identifying the position of the base relative to the 5' end. 

The results of the analysis identified Ifiultiple key regions in siRNA molecules 
that were critical for successful gene silencing. At the 3 '-most end of the sense strand 
(5'antisense), highly functional siRNA (>95% gene silencing, see Figure .6a, >F95) 

15 have a low internal stability (AISP of position 19 = -7.6kcal/mol). In contrast low- 
efficiency siRNA (z.e. those exhibiting less than 50% silencing, <F50) display a 
distinctly different profile, having high AG values (- -8.4kcal/mol) for the same 
position. Moving in a 5' (sense strand) direction, the intemal stability of highly 
efficient siRNA rises (position 12 = - -8.3kcal/mole) and then drops again (position 7 

20 = ~ -7.7kcal/iriol) before leveling off at a value of approximately -8.1kcal/mol for the 
5' terminus. SiRNA with poor silencing capabilities show a distinctly different 
profile. While the AISP value at position 12 is nearly identical with that of strong 
siRNAs, the values at positions 7 and 8 rise considerably^' peaking at a high of ~ -9.0 
kcal/moL In addition, at the 5' end of the molecule the AISP profile of strong and 

25 weak siRNA differ dramatically. Unlike the relatively strong values exhibited by 

siRNA in the >95% silencing group, siRNAs that exhibit poor silencing activity have 
weak AISP values (-7.6, -7.5, and -7.5 kcal/mol for positions 1, 2 and 3 respectively). 

Overall the profiles of both strong and weak siRNAs form distinct sinusoidal 
30 shapes that are roughly 1 80° out-of-phase with*each other. While these 

thermodynamic descriptions define the archetypal profile of a strong siRNA, it will 
likely be the case that neither the AG values given for key positions in the profile or 
the absolute position of the profile along the Y-axis {i.e. the AG -axis) are absolutes. 
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Profiles that are shifted upward or downward (Le. having on an average, higher or 
lower values at every position) but retain the relative shape and position of the profile 
along the X-axis can be foreseen as being equally effective as the model profile 
described here. Moreover, it is likely that siRNA that have strong or even stronger 

5 gene-specific silencing effects might have exaggerated AG values (either higher or 
lower) at key positions. Thus, for instance, it is possible that the 5'-most position of 
the sense strand (position 19) could have AG values of 7.4 kcal/mol or lower and still 
be a strong siRNA if, for instance, a G-C -> G-T/U mismatch were substituted at 
position 19 and altered duplex stability. Similarly, position 12 and position 7 could 

10 have values above 8.3 kcal/mol and below 7.7 kcal/mole, respectively, without 

abating the silencing effectiveness of the molecule. Thus, for instance, at position 12, 
a stabilizing chemical modification (e.g. a chemical modification of the 2' position of 
the sugar backbone) could be added that increases the average internal stability at that 
position. Similarly, at position 7, mismatches similar to those described previously 

1 5 could be introduced that would lower the AG values at that position. 

Lastly, it is important to note that while functional and non-functional siRNA 
were originally defined as those molecules having specific silencing properties, both 
broader or more limiting parameters can be used to define these molecules. As used 

20 herein, unless otherwise specified, "non-functional siRNA" are defined as those 

siRNA that induce less than 50% (<50%) target silencing, "semi-functional siRNA" 
induce 50-79% target silencing, "functional siRNA" are molecules that induce 80- 
95% gene silencing, and "highly- functional siRNA" are molecules that induce great 
than 95% gene silencing. These definitions are not intended to be rigid and can vary 

25 depending upon the design and needs of the application. For instance, it is possible 
that a researcher attempting to map a gene to a chromosome using a functional assay, 
may identify an siRNA that reduces gene activity by only 30%. While this level of 
gene silencing may be "non-functional" for e.g. therapeutic needs, it is sufficient for 
gene mapping purposes and is, under these uses and conditions, "functional." For 

30 these reasons, functional siRNA can be defined as those molecules having greater 
than 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% silencing capabilities at 
lOOnM transfection conditions. Similarly, depending upon the needs of the study 
and/or application, non-functional and semi-functional siRNA can be defined as 
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having different parameters. For instance, semi-functional siRNA can be defined as 
being those molecules that induce 20%, 30%, 40%, 50%, 60%, or 70% silencing at 
lOOnM transfection conditions. Similarly, non-functional siRNA can be defined as 
being those molecules that silence gene expression by less than 70%, 60%, 50%, 
5 40%, 30%, or less. Nonetheless, unless otherwise stated, the descriptions stated in 
the "Definitions" section of this text should be applied. 

Functional attributes can be assigned to each of the key positions in the AISP 
of strong siRNA. The low 5' (sense strand) AISP values of strong siRNAs may be 
10 necessary for determining which end of the molecule enters the RISC complex. In 

contrast, the high and low AISP values observed in the central regions of the molecule 
may be critical for siRNA-target mRNA interactions and product release, 
respectively. 

15 If the AISP values described above accurately define the thermodynamic 

parameters of strong siRNA, it would be expected that similar patterns would be 
observed in strong siRNA isolated fi-om nature. Natural siRNAs exist in a harsh, 
RNase-rich environment and it can be hypothesized that only those siRNA that 
exhibit heightened affinity for RISC (i.e, siRNA that exhibit an average intemal 

20 stability profile similar to those observed in strong siRNA) would survive in an 
intracellular environment. This hypothesis was tested using GFP-specific siRNA 
isolated fi-om N. benthamiana. Llave et al, (2002) Endogenous and Silencing- 
Associated Small RNAs in Plants, The Plant Cell 14, 1605T.16Jv9,antroduced long 
double-stranded GFP -encoding RNA into plants and subsequently re-isolated GFP- 

25 specific siRNA ft-om the tissues. The AISP of fifty-nine of these GFP-siRNA were 
determined, averaged, and subsequently plotted alongside the AISP profile obtained 
from the cyclophilin B/DBI/ luciferase siRNA having >90% silencing properties 
(Figure 6b). Comparison of the two groups show that profiles are nearly identical. 
This finding validates the information provided by the intemal stability profiles and 

30 demonstrates that: (1) the profile identified by analysis of the cyclophilin B/DBI/ 

luciferase siRNAs are not gene specific; and (2) AISP values can be used to search for 
strong siRNAs in a variety of species. 
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Both chemical modifications and base-pair mismatches can be incorporated 
into siRNA to aher the duplex's AISP and functionality. For instance, introduction of 
mismatches at positions 1 or 2 of the sense strand destabilized the 5'end of the sense 
strand and increases the functionality of the molecule (see Luc, Figure 7). Similarly, 
5 addition of 2'-0-methyl groups to positions 1 and 2 of the sense strand can also alter 
the AISP and (as a result) increase both the functionality of the molecule and 
eliminate off-target effects that results fi-om sense strand homology with the unrelated 
targets (Figures 8a, 8b). 

10 Rationale for Criteria in a Biological Context 

The fate of siRNA in the RNAi pathway may be described in 5 major steps: 
(1) duplex recognition and pre-RISC complex formation; (2) ATP-dependent duplex 
unwinding/strand selection and RISC activation; (3) mRNA target identification; (4) 
mRNA cleavage, and (5) product release (Figure 1). Given the level of nucleic acid- 

15 protein interactions at each step, siRNA functionality is likely influenced by specific 
biophysical and molecular properties that promote efficient interactions within the 
context of the multi-component complexes. Indeed, the systematic analysis of the 
siRNA test set identified multiple factors that correlate well with functionality. When 
combined into a single algorithm, they proved to be very effective in selecting active 

20 siRNAs. ; 



The factors described here may also be predictive of key functional 
associations important for each step in RNAi. For example, the potential^formation of 
internal hairpin structures correlated negatively with siRNA functionality. 

25 Complementary strands with stable internal repeats are more likely to exist as stable 
hairpins thus decreasing the effective concentration of the functional duplex form. 
This suggests that the duplex is the preferred conformation for initial pre-RISC 
association. Indeed, although single complementary strands can induce gene 
silencing, the effective concentration required is at least two orders of magnitude 

30 higher than that of the duplex form. 



siRNA-pre-RISC complex formation is followed by an ATP-dependent duplex 
imwinding step and "activation" of the RISC. The siRNA functionality was shown to 
correlate with overall low intemal stability of the duplex and low internal stability of 
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the 3' sense end (or differential internal stability of the 3' sense compare to the 5' 
sense strand), which may reflect strand selection and entry into the RISC. Overall 
duplex stability and low intemal stability at the 3 ' end of the sense strand were also 
correlated with siRNA functionality. Interestingly, siRNAs with very high and very 
5 low overall stability profiles correlate strongly with non- functional duplexes. One 
interpretation is that high intemal stability prevents efficient unwinding while very 
low stability reduces siRNA target affinity and subsequent mRNA cleavage by the 
RISC. 



10 Several criteria describe base preferences at specific positions of the sense 

strand and are even more intriguing when considering their potential mechanistic 
roles in target recognition and mRNA cleavage. Base preferences for A at position 19 
of the sense strand but not C, are particularly interesting because they reflect the same 
base preferences observed for naturally occurring miRNA precursors. That is, among 

15 the reported miRNA precursor sequences 75% contain a U at position 1 which 

corresponds to an A in position 19 of the sense strand of siRNAs,, while G was under- 
represented in this same position for miRNA precursors. These observations support 
the hypothesis that both miRNA precursors and siRNA duplexes are processed by 
very similar if not identical protein machinery. The functional interpretation of the 

20 predominance of a U/A base,pair is that it promotes flexibility at the 5'antisense ends 
of both siRNA duplexes and miRNA precursors and facilitates efficient unwinding 
iand selective strand entrance into an activated RISC. 

Among the criteria associated with base preferences that are likely to 
25 influence mRNA cleavage or possibly product release, the preference for U at 
position 1 0 of the sense strand exhibited the greatest impact, enhancing the 
probability of selecting an F80 sequence by 13.3%. Activated RISC preferentially 
cleaves target mRNA between nucleotides 10 and 1 1 relative to the 5' end of the 
complementary targeting strand. Therefore, it may be that U, the preferred base for 
30 most endoribonucle^es, at this position supports more efficient cleavage. 

Alternatively, a U/A bp between the targeting siRNA strand and its cognate target 
mRNA may create an optimal conformation for the RISC-Eissociated "slicing" 
activity. 
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Pooling 

According to another embodiment, the present invention provides a pool of at 
least two siRNAs, preferably in the form of a kit or therapeutic reagent, wherein one 
strand of each of the siRNAs, the sense strand comprises a sequence that is 
5 substantially similar to a sequence within a target mRNA. The opposite strand, the 
antisense strand, will preferably comprise a sequence that is substantially 
complementary to that of the target mRNA. More preferably, one strand of each 
siRNA will comprise a sequence that is identical to a sequence that is contained in the 
target mRNA. Most preferably, each siRNA will be 19 base pairs in length, and one 
10 strand of each of the siRNAs will be 100% complementary to a portion of the target 
mRNA. 

By increasing the number of siRNAs directed to a particular target using a 
pool or kit, one is able both to increase the likelihood that at least one siRNA with 

1 5 satisfactory functionality will be included, as well as to benefit from additive or 

synergistic effects. Further, when two or more siRNAs directed against a single gene 
do not have satisfactory levels of functionality alone, if combined, they may 
satisfactorily promote degradation of the target messenger RNA and successfully 
inhibit translation. By including multiple siRNAs in the system, not only is the 

20 probability of silencing increased,7but the economics of operation are also improved 
when compared to adding different siRNAs sequentially. This effect is contrary to the 
conventional wisdom that the concurrent use of multiple siRNA will negatively 
impact gene silencing (e.g. Holen, T. et al (2003) "Similar behavior o£single:strand 
and double strand siRNAs suggests they act through a common RNAi pathway." 

25 NAR 31: 2401-21407). 



In fact, when two siRNAs were pooled together, 54% of the pools of two 
siRNAs induced more than 95% gene silencing. Thus, a 2.5-fold increase in the 
percentage of functionality was achieved by randomly combining two siRNAs. 
30 Further, over 84% of pools containing two siRNAs induced more than 80% gene 
silencing. 

More preferably, the kit is comprised of at least three siRNAs, wherein one 
strand of each siRNA comprises a sequence that is substantially similar to a sequence 
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of the target mRNA and the other strand comprises a sequence that is substeintially 
complementary to the region of the target mRNA. As with the kit that comprises at 
least two siRNAs, more preferably one strand will comprise a sequence that is 
identical to a sequence that is contained in the mRNA and another strand that is 100% 
5 complementary to a sequence that is contained in the mRNA. During experiments, 
when three siRNAs were combined together, 60% of the pools induced more than 
95% gene silencing and 92% of the pools induced more than 80% gene silencing. 

Further, even more preferably, the kit is comprised of at least four siRNAs, 
10 wherein one strand of each siRNA comprises a sequence that is substantially similar 
to a region of the sequence of the target mRNA, and the other strand comprises a 
sequence that is substantially complementary to the region of the target mRNA. As 
with the kit or pool that comprises at least two siRNAs, more preferably one strand of 
each of the siRNA duplexes will comprise a sequence that is identical to a sequence 
15 that is contained in the mRNA, and another strand that is 100% complementary to a 
sequence that is contained in the mRNA. 

Additionally, kits and pools with at least five, at least six, and at least seven 
siRNAs may also be useful with the present invention. For example, pools of five 
20 siRNA induced 95% gene silencingjwith 77% probability and 80% silencing with 
98.8% probability. Thus, pooling of siRNAs together can result in the creation of a 
target-specific silencing reagent with almost a 99% probability of being functional. 
-^,.x.2ai=:^s*^ The fact that such high levels of success are achievable using such pools of siRNA,^^. 

enables one to dispense with costly and time-consuming target-specific validation 
25 procedures. 

For this embodiment, as well as the other aforementioned embodiments, each 
of the siRNAs within a pool will preferably comprise between 18 and 30 base pairs, 
more preferably between 1 8 and 25 base pairs, and most preferably 19 base pairs. 
30 Within each siRNA, preferably at least 1 8- contiguous bases of the antisense strand 
will be 100% complementary to the target mRNA. More preferably, at least 19 
contiguous bases of the antisense strand will be 100% complementary to the target 
mRNA. Additionally, there may be overhangs on either the sense strand or the 
antisense strand, and these overhangs may be at either the 5' end or the 3' end of 
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either of the strands, for example there may be one or more overhangs of 1 -6 bases. 
When overhangs are present, they are not included in the calculation of the number of 
base pairs. The two nucleotide 3' overhangs mimic natural siRNAs and are 
commonly used but are not essential. Preferably, the overhangs should consist of two 
5 nucleotides, most often dTdT or UU at the 3' end of the sense and antisense strand 
that are not complementary to the target sequence. The siRNAs may be produced by 
any method that is now known or that comes to be known for synthesizing double 
stranded RNA that one skilled in the art would appreciate would be useful in the 
present invention. Preferably, the siRNAs will be produced by Dharmacon's 
10 proprietary ACE® technology. However, other methods for synthesizing siRNAs are 
well known to persons skilled in the art and include, but are not limited to, any 
chemical synthesis of RNA oligonucleotides, ligation of shorter oligonucleotides, in 
vitro transcription of RNA oligonucleotides, the use of vectors for expression within 
cells, recombinant Dicer products and PGR products. 

15 

The siRNA duplexes within the aforementioned pools of siRNAs may 
correspond to overlapping sequences within a particulai- mRNA, or non-overlapping 
sequences of the mRNA. However, preferably they correspond to non-overlapping 
sequences. Further, each siRNA may be selected randomly, or one or more of the 
20 siRNA may be selected according to the:eriteria discussed above for maximizing the 
effectiveness of siRNA. 

T^:t:».v!2£j.c.r.- Included in the definition of siRNAs are siRNAs that contain substituted -^v ::vv ^ 

and/or labeled nucleotides that may, for example, be labeled by radioactivity, 
25 fluorescence or mass. The most common substitutions are at the 2' position of the 

ribose sugar, where moieties such as H (hydrogen) F, NH3, OCH3 and other O- alkyl, 
alkenyl, alkynyl, and orthoesters, may be substituted, or in the phosphorous backbone, 
where sulfur, amines or hydrocarbons may be substituted for the bridging of non- 
bridging atoms jn the phosphodiester bond. Examples of modified siRNAs are 
30 explained more fiiUy in commonly assigned U.S. Patent Application Ser. No. 
10/613,077, filed July 1, 2003, which is incorporated by reference herein. 

Additionally, as noted above, the cell type into which the siRNA is introduced 
may affect the ability of the siRNA to enter the cell; however, it does not appear to 
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affect the ability of the siRNA to function once it enters the cell. Methods for 
introducing double-stranded RNA into various cell types are well known to persons 
skilled in the art. 



5 As persons skilled in the art are aware, in certain species, the presence of 

proteins such as RdRP, the RNA-dependent RNA polymerase, may catalytically 
enhance the activity of the siRNA. For example, RdRP propagates the RNAi effect in 
C elegans and other non-mammalian organisms. In fact, in organisms that contain 
these proteins, the siRNA may be inherited. Two other proteins that are well studied 
10 and known to be a part of the machinery are members of the Argonaute family and 
Dicer, as well as their homologues. There is also initial evidence that the RISC 
complex might be associated with the ribosome so the more efficiently translated 
mRNAs will be more susceptible to silencing than others. 



1 5 Another very important factor in the efficacy of siRNA is mRNA localization. 

In general, only cytoplasmic mRNAs are considered to be accessible to RNAi to any 
appreciable degree. However, appropriately designed siRNAs, for example, siRNAs 
modified with intemucleotide linkages, may be able to cause silencing by acting in the 
nucleus. Examples of these types of modifications are described in commonly 

20 assigned U.S. Patent Application Serial Nosr^l 0/43 1,027 and 10/613,077, each of 
which is incorporated by reference herein. 

As described above, even when one selects at least two siRNAs at random,..the 
effectiveness of the two may be greater than one would predict based on the 

25 effectiveness of two individual siRNAs. This additive or synergistic effect is 
particularly noticeable as one increases to at least three siRNAs, and even more 
noticeable eis one moves to at least four siRNAs. Surprisingly, the pooling of the non- 
functional and semi-functional siRNAs, particularly more than five siRNAs, can lead 
to a silencing mixture that is as effective if not more effective than any one particular 

30 functional siRNA. 
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Within the kit of the present invention, preferably each siRNA will be present 
in a concentration of between 0.001 and 200 \xM, more preferably between 0.01 and 
200 nM, and most preferably between 0.1 and 10 nM. 



5 In addition to preferably comprising at least four or five siRNAs, the kit of the 

present invention will also preferably comprise a buffer to keep the siRNA duplex 
stable. Persons skilled in the art are aware of buffers suitable for keeping siRNA 
stable. For example, the buffer may be comprised of 100 mM KCl, 30 mM HEPES- 
pH 7.5, and 1 mM MgCb. Alternatively, kits might contain complementary strands 
10 that contain any one of a number of chemical modifications (e.g, a 2'-0-ACE) that 
protect the agents fi-om degradation by nucleases. In this instance, the user may (or 
may not) remove the modifying protective group {e.g. deprotect) before annealing the 
two complementary strands together. 

15 By way of example, the kit may be organized such that pools of siRNA 

duplexes are provided on an array or microarray of wells or drops for a particular 
gene set or for unrelated genes. The array may, for example, be in 96 wells, 384 wells 
or 1284 wells arrayed in a plastic plate or on a glass slide using techniques now 
known or that come to be known to persons skilled in the art. Within an array, 

20 preferably there will be controls such as fimctionafl anti-lamin A/C, cyclophilin and 
two siRNA duplexes that are not specific to the gene of interest. 

• In order to ensure stability of the siRNA pools prior to usage, they may be 

retained in lyophilized form at minus twenty degrees (— 20°C) until they are ready for 

25 . use. Prior to usage, they should be resuspended; however, even once resuspended, for 
example, in the aforementioned buffer, they should be kept at minus twenty degrees, 
(— 20°C) until used. The aforementioned buffer, prior to use, may be stored at 
approximately 4''C or room temperature. Effective temperatures at which to conduct 
transfections are well known to persons skilled in the art and include for example, 

30 room temperature. 

The kit may be applied either in vivo or in vitro. Preferably, the siRNA of the 
pools or kits is applied to a cell through transfection, employing standard transfection 
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protocols. These methods are well known to persons skilled in the art and include the 
use of lipid-based carriers, electroporation, cationic carriers, and microinjection. 
Further, one could apply the present invention by synthesizing equivalent DNA 
sequences (either as two separate, complementary strands, or as hairpin molecules) 
5 instead of siRNA sequences and introducing them into cells through vectors. Once in 
the cells, the cloned DNA could be transcribed, thereby forcing the cells to generate 
the siRNA. Examples of vectors suitable for use with the present application include 
but are not limited to the standard transient expression vectors, adenoviruses, 
retroviruses, lentivirus-based vectors, as well as other traditional expression vectors. 
10 Any vector that has an adequate siRNA expression and procession module may be 
used. Furthermore, certain chemical modifications to siRNAs, including but not 
limited to conjugations to other molecules, may be used to facilitate delivery. For 
certain applications it may be preferable to deliver molecules without transfection by 
simply formulating in a physiological acceptable solution. 

15 

This embodiment may be used in connection with any of the aforementioned 
embodiments. Accordingly, the sequences within any pool may be selected by 
rational design. 

20 

Multigene SUencing 

In addition to developing kits that contain multiple siRNA directed against a 
jrsingle gene, another embodiment includes the use of multiple siRNA targeting : ^kir^t^ 

multiple genes. Multiple genes may be targeted through the use of high- or hyper- 
25 functional siRNA. High- or hyper- functional siRNA that exhibit increased potency, 
require lower concentrations to induce desired phenotypic (and thus therapeutic) 
effects. This circumvents RISC saturation. It therefore reasons that if lower 
concentrations of a single siRNA are needed for knockout or knockdown expression 
of one gene, then the remaining (uncomplexed) RISC will be firee and available to 
30 interact with siRNA directed against two, three, four, or more, genes. Thus in this 
embodiment, the authors describe the use of highly functional or hyper-functional 
siRNA to knock out three separate genes. More preferably, such reagents could be 
combined to knockout four distinct genes. Even more preferably, highly functional or 
. hyperfunctional siRNA could be used to knock out five distinct genes. Most 
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preferably, siRNA of this type could be used to knockout or knockdown the 
expression of six or more genes. 

Hvperfunctional siRNA 

5 The term hyperfunctional siRNA (hf-siRNA) describes a subset of the siRNA 

population that induces RNAi in cells at low- or sub-nanomolar concentrations for 
extended periods of time. These traits, heightened potency and extended longevity of 
the RNAi phenotype, are highly attractive from a therapeutic standpoint. Agents 
having higher potency require lesser amounts of the molecule to achieve the desired 
10 physiological response, thus reducing the probability of side effects due to "off- 
target" interference. In addition to the potential therapeutic benefits associated with 
hyperfunctional siRNA, hf-siRNA are also desirable from an economic position. 
Hyperfunctional siRNA may cost less on a per-treatment basis, thus reducing the 
overall expenditures to both the manufacturer and the consumer. 

15 Identification of hyperfunctional siRNA involves multiple steps that are 

designed to examine an individual siRNA agent's concentration- and/or longevity- 
profiles. In one non-limiting example, a population of siRNA directed against a single 
gene are first analyzed using the previously described algorithm (Formula VIII). 
Individual siRNA are then introduced into a test cell line and assessed for the ability 

20 to degrade the target mRNA. It is important to note that when performing this step it 
is not necessary to test all of the siRNA. Instead, it is sufficient to test only those 
siRNA having the highest SMARTscores™ (/.e. SMARTscore™ > -10). 
Subsequently, the gene silencing data is plotted against the SMARTscores™ (see 
Figure 9). SiRNA that (1) induce a high degree of gene silencing (i.e. they induce 

25 greater than 80% gene knockdown) and (2) have superior SMARTscores™ (i.e, a 

SMARTscore^^ of > -10, suggesting a desirable average internal stability profile) are 
selected for further investigations designed to better understand the molecule's 
potency and longevity. In one, non-limiting study dedicated to understanding a 
molecule's potency, an siRNA is introduced into one (or more) cell types in 

30 increasingly diminishing concentrations (e.g. 3.0 ^ 0.3 nM). Subsequently, the level 
of gene silencing induced by each concentration is examined and siRNA that exhibit 
hyperfunctional potency (i,e. those that induce 80% silencing or greater at e.g. 
picomolar concentrations) are identified. In a second study, the longevity profiles of 
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siRNA having high (>-10) SMARTscores™ and greater than 80% silencing are 
examined. In one non-hmiting example of how this is achieved, siRNA are introduced 
into a test cell line and the levels of RNAi are measured over an extended period of 
time {e.g. 24-168 hrs). SiRNAs that exhibit strong RNA interference patterns {i.e. 
5 >80 % interference) for periods of time greater than, e.g., 120 hours are thus 

identified. Studies similar to those described above can be performed on any and all 
of the >10^ siRNA included in this document to further define the most fiinctional 
molecule for any given gene. Molecules possessing one or both properties (extended 
longevity and heightened potency) are labeled "hyperfunctional siRNA," and 
10 earmarked as candidates for future therapeutic studies. 

While the example(s) given above describe one means by which 
hyperfiinctional siRNA can be isolated, neither the assays themselves nor the 
selection parameters used are rigid and can vary with each family of siRNA. Families 
15 of siRNA include siRNAs directed against a single gene, or directed against a related 
family of genes. 

The highest quality siRNA achievable for any given gene may vary 
considerably. Thus, for example, in the case of one gene (gene X), rigorous studies 

20 such as those described above may enable the identification.of an siRNA that, at 
picomolar concentrations, induces 99"*^% silencing for a period of 10 days. Yet 
identical studies of a second gene (gene Y) may yield an siRNA that at high 
nanomolar concentrations {e.g. lOOnM) induces only 75% silencing for a period of 2 
days. Both molecules represent the very optimum siRNA for their respective gene 

25 targets and therefore are designated "hj^erfunctional." Yet due to a variety of factors 
including but not limited to target concentration, siRNA stability, cell type, off-target 
interference, and others, equivalent levels of potency and longevity are not 
achievable. Thus, for these reasons, the parameters described in the before mentioned 
assays, can vary. While the initial screen selected siRNA that had SMARTscores™ 

30 above -10 and a gene silencing capability of greater than 80%, selections that have 
stronger (or weaker) parameters can be implemented. Similarly, in the subsequent 
studies designed to identify molecules with high potency and longevity, the desired 
cutoff criteria {i.e. the lowest concentration that induces a desirable level of 
interference, or the longest period of time that interference can be observed) can vary. 
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The experimentation subsequent to application of the rational criteria of this 
application is significantly reduced where one is trying to obtain a suitable 
hyperfunctional siRNA for, for example, therapeutic use. When, for example, the 
additional experimentation of the type described herein is applied by one skilled in the 
5 art with this disclosure in hand, a hyperfunctional siRNA is readily identified. 

The siRNA may be introduced into a cell by any method that is now known or 
that comes to be known and that fi-om reading this disclosure, persons skilled in the 
art would determine would be usefiil in connection with the present invention in 

10 enabUng siRNA to cross the cellular membrane. These methods include, but are not 
limited to, any memner of transfection, such as for example transfection employing 
DEAE-Dextran, calcium phosphate, cationic lipids/liposomes, micelles, manipulation 
of pressure, microinjection, electroporation, immunoporation, use of vectors such as 
viruses, plasmids, cosmids, bacteriophages, cell fusions, and coupling of the 

1 5 polynucleotides to specific conjugates or ligands such as antibodies, antigens, or 

receptors, passive introduction, adding moieties to the siRNA that facilitate its uptake, 
and the like. 

Having described the invention with a degree of particularity, examples will 
20 now be provided. These examples are not intended to and should not be construed to 
limit the scope of the claims in any way. 

i*iwi^-,i£jjLit^ Examples 

25 General Techniques and Nomenclatures 

siRNA nomenclature. All siRNA duplexes are referred to by sense strand. The first 
nucleotide of the 5 '-end of the sense strand is position 1, which corresponds to 
position 19 of the antisense strand for a 19-mer. In most cases, to compare results 
firom different experiments, silencing was determined by measuring specific transcript 
30 mRNA levels or enzymatic activity associated with specific transcript levels, 24 hours 
post-transfection, with siRNA concentrations held constant at 100 nM. For all 
experiments, unelss otherwise specified transfection efficiency was ensured to be over 
95%, and no detectable cellular toxicity was observed. The following system of 
nomenclature was used to compare and report siRNA-silencing functionality: "F" 
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followed by the degree of minimal knockdown. For example, F50 signifies at least 
50% knockdown, F80 means at least 80%, and so forth. For this study, all sub-F50 
siRNAs were considered non-functional. 

Cell culture and transfection. 96-well plates are coated with 50 |xl of 50 mg/ml poly- 
L-lysine (Sigma) for 1 hr, and then washed 3X with distilled water before being dried 
for 20 min. HEK293 cells or HEK293Lucs or any other cell type of interest are 
released from their solid support by trypsinization, diluted to 3.5 X 10^ cells/ml, 
followed by the addition of 100 |aL of cells/well. Plates are then incubated overnight 
at 37*^ C, 5% CO2. Transfection procedures can vary widely depending on the cell 
type and transfection reagents. In one non-limiting example, a transfection mixture 
consisting of 2 mL Opti-MEM I (Gibco-BRL), 80 ^1 Lipofectamine 2000 
(Invitrogen), 15 |iL SUPERNasin at 20 U/|al (Ambion), and 1.5 |il of reporter gene 
plasmid at 1 \ig/ \il is prepared in 5-ml polystyrene round bottom tubes. 100 jal of 
transfection reagent is then combined with 100 |xl of siRNAs in polystyrene deep-well 
titer plates (Beckman) and incubated for 20 to 30 min at room temp. 550 jal of Opti- 
MEM is then added to each well to bring the final siRNA concentration to 100 nM. 
Plates are then sealed with parafilm and mixed. Media is removed from HEK293 
cells and replaced with 95 |xl of transfection mixture. Cells are incubated ovemight at 
37^ C, 5% CO2. 

Quantification of gene knockdown. A variety of quantification procedures can be 
used to measure the level of silencing induced by siRNA or siRNA pools. In one non- 
limiting example: to measure mRNA levels 24 hrs post-transfection, QuantiGene 
25 branched-DNA (bDNA) kits (Bayer) (Wang, et al Regulation of insulin preRNA 
splicing by glucose, Proc Natl Acad Sci 1997, 94:4360.) are used according to 
manufacturer instructions. To measure luciferase activity, media is removed from 
HEK293 cells 24 hrs post-transfection, and 50 )il of Steady-GLO reagent (Promega) 
is added. After 5 min, plates are analyzed on a plate reader. 

30 

Example I. Sequences Used to Develop the Algorithm. 

Anti-Firefly and anti-Cyclophilin siRNAs panels (Figure 5a, b) sorted 
according to using Formula VIII predicted values. All siRNAs scoring more than 0 
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15 
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(formula VIII) and more then 20 (formula IX) are fully functional. All ninety 
sequences for each gene (and DBI) appear below in Table III. 



TABLE III 



Cyclo 


1 


SEQ. ID 0032 


Cyclo 


2 


SEQ. ID 0033 


Cyclo 


3 


SEQ. ID 0034 


Cyclo 


4 


SEQ. ID 0035 


Cyclo 


5 


SEQ. ID 0036 


Cyclo 


6 


SEQ. ID 0037 


Cyclo 


7 


SEQ. ID 0038 


Cyclo 


. 8 


SEQ. ID 0039 


Cyclo 


9 


SEQ. ID 0040 


Cyclo 


10 


SEQ. ID 0041 


Cyclo 


11 


SEQ. ID 0042 


Cyclo 


12 


SEQ. ID 0043 


Cyclo 


13 


SEQ. ID 0044 


Cyclo 


14 


SEQ. ID 0045 


Cyclo 


15 


SEQ. ID 0046 


Cyclo 


16 


SEQ. ID 0047 


Cyclo 


17 


SEQ. ID 0048 


Cyclo 


18 


SEQ. ID 0049 


Cyclo 


19 


SEQ. ID 0050 


Cyclo 


20 


SEQ. ID 0051 


Cyclo 


21 


SEQ. ID 0052 


Cyclo 


22 


SEQ. ID 0053 


Cyclo 


23 


SEQ. ID .0054 


Cyclo 


24 


SEQ. ID 0055 


Cyclo 


25 


SEQ. ID 0056 


Cyclo 


26 


SEQ. ID 0057 


Cyclo 


27 


SEQ, ID 0058 


Cyclo 


28 


SEQ. ID 0059 


Cyclo 


29 


SEQ. ID 0060 


Cyclo 


30 


SEQ. ID 0061 


Cyclo 


31 


SEQ. ID 0062 


Cyclo 


32 


SEQ. ID 0063 


Cyclo 


33 


SEQ. ID 0064 


Cyclo 


34 


SEQ. ID 0065 



GUUCCAAAAACAGUGGAUA 

UCCAAAAACAGUGGAUAAU 

CAAAAACAGUGGAUAAUUU 

AAAACAGUGGAUAAUUUUG 

AACAGUGGAUAAUUUUGUG 

CAGUGGAUAAUUUUGUGGC 

GUGGAUAAUUUUGUGGCCU 

GGAUAAUUUUGUGGCCUUA 

AUAAUUUUGUGGCCUUAGC 

AAUUUUGUGGCCUUAGCUA 

UUUUGUGGCCUUAGCUACA 

UUGUGGCCUUAGCUACAGG 

GUGGCCUUAGCUACAGGAG 

GGCCUUAGCUACAGGAGAG 

CCUUAGCUACAGGAGAGAA 

UUAGCUACAGGAGAGAAAG 

AGCUACAGGAGAGAAAGGA 

CUACAGGAGAGAAAGGAUU 

ACAGGAGAGAAAGGAUUUG 

AGGAGAGAAAGGAUUUGGC 

GAGAGAAAGGAUUUGGCUA 

GAGAAAGGAUUUGGCUACA 

GAAAGGAUUUGGCUACAAA 

AAGGAUUUGGCUACAAAAA 

GGAUUUGGCUACAAAAACA 

AUUUGGCUACAAAAACAGC 

UUGGCUACAAAAACAGCAA 

GGCUACAAAAACAGCAAAU 

CUACAAAAACAGCAAAUUC 

ACAAAAACAGCAAAUUCCA 

AAAAACAGCAAAUUCCAUC 

AAACAGCAAAUUCCAUCGU 

ACAGCAAAUUCCAUCGUGU 

AGCAAAUUCCAUCGUGUAA 



Cyclo 


35 


SEQ. ID 0066 


CAAAUUCCAUCGUGUAAUC 


Cyclo 


36 


SEQ. ID 0067 


AAUUCCAUCGUGUAAUCAA 


Cyclo 


37 


SEQ. ID 0068 


UUCCAUCGUGUAAUCAAGG 


Cyclo 


38 


SEQ. ID 0069 


CCAUCGUGUAAUCAAGGAC 


Cyclo 


39 


SEQ. ID 0070 


AUCGUGUAAUCAAGGACUU 


Cyclo 


40 


SEQ. ID 0071 


CGUGUAAUCAAGGACUUCA 


Cyclo 


41 


SEQ. ID 0072 


UGUAAUCAAGGACUUCAUG 


Cyclo 


42 


SEQ. ID 0073 


UAAUCAAGGACUUCAUGAU 


Cyclo 


43 


SEQ. ID 0074 


AUCAAGGACUUCAUGAUCC 


Cyclo 


44 


SEQ. ID 0075 


CAAGGACUUCAUGAUCCAG 


Cyclo 


45 


SEQ. ID 0076 


AGGACUUCAUGAUCCAGGG 


Cyclo 


46 


SEQ. ID 0077 


GACUUCAUGAUCCAGGGCG 


Cyclo 


47 


SEQ. ID 0078 


CUUCAUGAUCCAGGGCGGA 


Cyclo 


48 


SEQ. ID 0079 


UCAUGAUCCAGGGCGGAGA 


Cyclo 


49 


SEQ. ID 0080 


AUGAUCCAGGGCGGAGACU 


i_/ycio 


DV 


oxlV^. kU UUo 1 


OA U i^i^ ALrLivji.^vjO A vj AC U U U 


Cyclo 


51 


SEQ. ID 0082 


UCCAGGGCGGAGACUUCAC 


Cyclo 


52 


SEQ. ID 0083 


CAGGGCGGAGACUUCACCA 


Cyclo 


53 


SEQ. ID 0084 


GGGCGGAGACUUCACCAGG 


Cyclo 


54 


SEQ. ID 0085 


GCGGAGACUUCACCAGGGG 


Cyclo 


55 


SEQ. ID 0086 


GGAGACUUCACCAGGGGAG 


Cyclo 


- 56 


SEQ. ID 0087 


AGACUUCACCAGGGGAGAU 


Cyclo 


57 


SEQ. ID 0088 


ACUUCACCAGGGGAGAUGG 


Cyclo 


58 


SEQ. ID 0089 


UUCACCAGGGGAGAUGGCA 


Cyclo 


59 


SEQ. ID 0090 


CACCAGGGGAGAUGGCACA 


Cyclo 


60 


SEQ, ID 0091 


CCAGGGGAGAUGGCACAGG 


Cyclo 


61 


SEQ. ID 0092 " 


AGGGGAGAUGGCACAGGAG 


Cyclo 


62 


SEQ. ID 0093 


GGGAGAUGGCACAGGAGGA 


Cyclo 


63 


SEQ. ID 0094 


GAGAUGGCACAGGAGGAAA 


Cyclo 


64 


SEQ. ID 0095 


GAUGGCACAGGAGGAAAGA 


Cyclo 


65 


SEQ. ID 0094 


UGGCACAGGAGGAAAGAGC 


Cyclo 


66 


SEQ. ID 0096 


GCACAGGAGGAAAGAGCAU 


Cyclo 


67 


SEQ. ID 0097 


ACAGGAGGAAAGAGCAUCU 


Cyclo 


68 


SEQ. ID 0098 


AGGAGGAAAGAGCAUCUAC 


Cyclo 


69 


SEQ. ID 0099 


GAGGAAAGAGCAUCUACGG 


Cyclo 


70 


SEQ. ID 0100 


GGAAAGAGCAUCUACGGUG 


Cyclo 


71 


SEQ. ID 0101 


AAAGAGCAUCUACGGUGAG 


Cyclo 


72 


SEQ. ID 0102 


AGAGCAUCUACGGUGAGCG 


Cyclo 


73 


SEQ. ID 0103 


AGCAUCUACGGUGAGCGCU 



Cyclo 


74 


SEQ. ID0104 


CAUCUACGGUGAGCGCUUC 


Cyclo 


75 


SEQ.ID0105 


UCUACGGUGAGCGCUUCCC 


Cycld 


76 


SEQ. ID 0106 


UACGGUGAGCGCUUCCCCG 


Cyclo 


77 


SEQ. ID 0107 


CGGUGAGCGCUUCCCCGAU 


Cyclo 


78 


SEQ. ID 0108 


GUGAGCGCUUCCCCGAUGA 


Cyclo 


79 


SEQ. ID 0109 


GAGCGCUUCCCCGAUGAGA 


Cyclo 


80 


SEQ. ID 01 10 


GCGCUUCCCCGAUGAGAAC 


Cyclo 


81 


SEQ. ID 0111 


GCUUCCCCGAUGAGAACUU 


Cyclo 


82 


SEQ. ID 01 12 


UUCCCCGAUGAGAACUUCA 


Cyclo 


83 


SEQ. ID 01 13 


CCCCGAUGAGAACUUCAAA 


Cyclo 


84 


SEQ. ID 01 14 


CCGAUGAGAACUUCAAACU 


Cyclo 


85 


SEQ. ID 01 15 


GAUGAGAACUUCAAACUGA 


Cyclo 


86 


SEQ. ID 01 16 


UGAGAACUUCAAACUGAAG 


Cyclo 


87 


. SEQ. ID 01 17 


AGAACUUCAAACUGAAGCA 


Cyclo 


88 


SEQ. ID 01 18 


AACUUCAAACUGAAGCACU 


Cyclo 


89 


SEQ. ID 01 19 


CUUCAAACUGAAGCACUAC 


Cyclo 


90 


SEQ. ID 0120 


UCAAACUGAAGCACUACGG 


DB 


1 


SEQ. ID 0121 


ACGGGCAAGGCCAAGUGGG 


DB 


2 


SEQ. ID 0122 


CGGGCAAGGCCAAGUGGGA 


DB 


3 


SEQ. ID 0123 


GGGCAAGGCCAAGUGGGAU 


DB 


4 


SEQ. ID 0124 


GGCAAGGCCAAGUGGGAUG 


DB 


5 


SEQ. ID 0125 


GCAAGGCCAAGUGGGAUGC 


DB 


6 


SEQ. ID 0126 


CAAGGCCAAGUGGGAUGCC 


DB 


7 


SEQ. ID 0127 


AAGGCCAAGUGGGAUGCCU 


DB 


8 


SEQ. ID 0128 


AGGCCAAGUGGGAUGCCUG 


DB 


9 


SEQ. ID 0129 


GGCCAAGUGGGAUGCCUGG 


DB 


10 


SEQ. ID 0130 


GCCAAGUGGGAUGCCUGGA 


DB 


11 


SEQ. ED 0131 


CCAAGUGGGAUGCCUGGAA 


DB 


12 


SEQ. ID 0132 


CAAGUGGGAUGCCUGGAAU 


DB 


13 


SEQ. ID 0133 


AAGUGGGAUGCCUGGAAUG 


DB 


14 


SEQ. ID 0134 


AGUGGGAUGCCUGGAAUGA 


DB 


15 


SEQ. ID 0135 


GUGGGAUGCCUGGAAUGAG 


DB 


16 


SEQ. ID 0136 


UGGGAUGCCUGGAAUGAGC 


DB 


17 


SEQ. ID 0137 


GGGAUGCCUGGAAUGAGCU 


DB 


18 


SEQ. ID 0138 


GGAUGCCUGGAAUGAGCUG 


DB 


19 


SEQ. ID 0139 


GAUGCCUGGAAUGAGCUGA 


DB 


20 


SEQ. ID 0140 


AUGCCUGGAAUGAGCUGAA 


DB 


21 


SEQ. ID 0141 


UGCCUGGAAUGAGCUGAAA 


DB 


22 


SEQ. ID 0142 


GCCUGGAAUGAGCUGAAAG 


DB 


23 


SEQ. ID 0143 


CCUGGAAUGAGCUGAAAGG 



DB 


24 


SEQ. ID 0144 


CUGGAAUGAGCUGAAAGGG 


DB 


25 


SEQ. ID 0145 


UGGAAUGAGCUGAAAGGGA 


DB 


26 


SEQ. ID 0146 


GGAAUGAGCUGAAAGGGAC 


DB 


27 


SEQ. ID 0147 


GAAUGAGCUGAAAGGGACU 


DB . 


28 


SEQ. ID 0148 


AAUGAGCUGAAAGGGACUU 


DB 


29 


SEQ. ID 0149 


AUGAGCUGAAAGGGACUUC 


DB 


30 


SEQ. ID 0150 


UGAGCUGAAAGGGACUUCC 


DB 


31 


SEQ. ID 0151 


GAGCUGAAAGGGACUUCCA 


DB 


32 


SEQ. ID 0152 


AGCUGAAAGGGACUUCCAA 


DB 


33 


SEQ. ID 0153 


GCUGAAAGGGACUUCCAAG 


DB 


34 


SEQ. ID 0154 


CUGAAAGGGACUUCCAAGG 


DB 


35 


SEQ. ID 0155 


UGAAAGGGACUUCCAAGGA 


DB 


36 


SEQ. ID 0156 


GAAAGGGACUUCCAAGGAA 


DB 


37 


SEQ. ID 0157 


AAAGGGACUUCCAAGGAAG 


DB 


38 


SEQ. ID 0158 


AAGGGACUUCCAAGGAAGA 


DB 


39 


SEQ. ID 0159 


AGGGACUUCCAAGGAAGAU 


DB 


40 


SEQ. ID 0160 


GGGACUUCCAAGGAAGAUG 


DB 


41 


SEQ. ID 0161 


GGACUUCCAAGGAAGAUGC 


DB 


42 


SEQ. ID 0162 


GACUUCCAAGGAAGAUGCC 


DB 


43 


SEQ. ID 0163 


ACUUCCAAGGAAGAUGCCA 


DB 


44 


SEQ. ID 0164 


CUUCCAAGGAAGAUGCCAU 


DB 


45 


SEQ. ID 0165 


UUCCAAGGAAGAUGCCAUG 


DB 


46 


SEQ. ID 0166 


UCCAAGGAAGAUGCCAUGA 


DB 


47 


SEQ. ID 0167 


CCAAGGAAGAUGCCAUGAA 


DB 


48 


SEQ. ID 0168 


CAAGGAAGAUGCCAUGAAA 


DB 


49 


SEQ. ID 0169 


AAGGAAGAUGCCAUGAAAG 


DB 


50 


SEQ. ID 0170 


AGGAAGAUGCCAUGAAAGC 


DB 


51 


SEQ. ID 0171 


GGAAGAUGCCAUGAAAGCU 


DB 


52 


SEQ. ID 0172 


GAAGAUGCCAUGAAAGCUU 


DB 


53 


SEQ. ID 0173 


AAGAUGCCAUGAAAGCUUA 


DB 


54 


SEQ. ID 0174 


AGAUGCCAUGAAAGCUUAC 


DB 


55 


SEQ. ID 0175 


GAUGCCAUGAAAGCUUACA 


DB 


56 


SEQ. ID 0176 


AUGCCAUGAAAGCUUACAU 


DB 


57 


SEQ. ED 0177 


UGCCAUGAAAGCUUACAUC 


DB 


58 


SEQ. ID 0178 


GCCAUGAAAGCUUACAUCA 


DB 


59 


SEQ. ID 0179 


CCAUGAAAGCUUACAUCAA 


DB 


60 


SEQ. ID 0180 


CAUGAAAGCUUACAUCAAC 


DB 


61 


SEQ. ID 0181 


AUGAAAGCUUACAUCAACA 


DB 


62 


SEQ. ID 0182 


UGAAAGCUUACAUCAACAA 


DB 


63 


SEQ. ID 0183 


GAAAGCUUACAUCAACAAA 



DB 


64 


SEQ. ID0184 


AAAGCUUACAUCAACAAAG 


DB 


65 


SEQ.ro 0185 


AAGCUUACAUCAACAAAGU 


DB 


66 


SEQ.ro 0186 


AGCUUACAUCAACAAAGUA 


DB 


67 


SEQ. ro 0187 


GCUUACAUCAACAAAGUAG 


DB 


68 


SEQ. ro 0188 


CUUACAUCAACAAAGUAGA 


DB 


69 


SEQ. ro 0189 


UUACAUCAACAAAGUAGAA 


DB 


70 


SEQ. ro 0190 


UACAUCAACAAAGUAGAAG 


DB 


71 


SEQ. ro 0191 


ACAUCAACAAAGUAGAAGA 


DB 


72 


SEQ. ro 0192 


CAUCAACAAAGUAGAAGAG 


DB 


73 


SEQ. ro 0193 


AUCAACAAAGUAGAAGAGC 


DB 


74 


SEQ. ro 0194 


UCAACAAAGUAGAAGAGCU 


DB 


75 


SEQ. ro 0195 


CAACAAAGUAGAAGAGCUA 


DB 


76 


SEQ. ro 0196 


AACAAAGUAGAAGAGCUAA 


DB 


77 


SEQ. ro.0197 


ACAAAGUAGAAGAGCUAAA 


DB 


78 


SEQ. ro 0198 


CAAAGUAGAAGAGCUAAAG 


DB 


79 


SEQ. ro 0199 


AAAGUAGAAGAGCUAAAGA 


DB 


80 


SEQ. ro 0200 


AAGUAGAAGAGCUAAAGAA 


DB 


81 


SEQ. ro 0201 


AGUAGAAGAGCUAAAGAAA 


DB 


82 


SEQ. ro 0202 


GUAGAAGAGCUAAAGAAAA 


DB 


83 


SEQ. ID 0203 


UAGAAGAGCUAAAGAAAAA 


DB 


84 


SEQ, ro 0204 


AGAAGAGCUAAAGAAAAAA 


DB 


85 


SEQ. ro 0205 


GAAGAGCUAAAGAAAAAAU 


DB 


86 


SEQ. ro 0206 


AAGAGCUAAAGAAAAAAUA 


DB 


87 


SEQ. ro 0207 


AGAGCUAAAGAAAAAAUAC 


DB 


88 


SEQ. ID 0208 


GAGCUAAAGAAAAAAUACG 


DB 


89 


SEQ, ro 0209 


AGCUAAAGAAAAAAUACGG 


DB 


90 


SEQ. ID 0210 


GCUAAAGAAAAAAUACGGG 


Luc 


1 


SEQ. ro 0211 


AUCCUCAUAAAGGCCAAGA 


Luc 


2 


SEQ. ID 0212 


AGAUCCUCAUAAAGGCCAA 


Luc 


3 


SEQ. ID 0213 


AGAGAUCCUCAUAAAGGCC 


Luc 


4 


SEQ. ID 0214 


AGAGAGAUCCUCAUAAAGG 


Luc 


5 


SEQ.ro 0215 


UCAGAGAGAUCCUCAUAAA 


Luc 


6 


SEQ. ID 0216 


AAUCAGAGAGAUCCUCAUA 


Luc 


7 


SEQ. ro 0217 


AAAAUCAGAGAGAUCCUCA 


Luc 


8 


SEQ.ro 0218 


GAAAAAUCAGAGAGAUCCU 


Luc 


9 


SEQ.ro 0219 


AAGAAAAAUCAGAGAGAUC 


Luc 


10 


SEQ. ro 0220 


GCAAGAAAAAUCAGAGAGA 


Luc 


11 


SEQ. ro 0221 


ACGCAAGAAAAAUCAGAGA 


Luc 


12 


SEQ, ro 0222 


CGACGCAAGAAAAAUCAGA 


Luc 


13 


SEQ. ro 0223 


CUCGACGCAAGAAAAAUCA 
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Luc 


14 


SEQ, ID 0224 


AACUCGACGCAAGAAAAAU 


Luc 


15 


SEQ. ID 0225 


AAAACUCGACGCAAGAAAA 


Luc 


16 


SEQ. ID 0226 


GGAAAACUCGACGCAAGAA 


Luc 


17 


SEQ. ID 0227 


CCGGAAAACUCGACGCAAG 


Luc 


18 


SEQ. ID 0228 


UACCGGAAAACUCGACGCA 


Luc 


19 


SEQ. ID 0229 


CUUACCGGAAAACUCGACG 


Luc 


20 


SEQ. ID 0230 


GUCUUACCGGAAAACUCGA 


Luc 


21 


SEQ. ID 0231 


AGGUCUUACCGGAAAACUC 


Luc 


22 


SEQ. ID 0232 


AAAGGUCUUACCGGAAAAC 


Luc 


23 


SEQ. ID 0233 


CGAAAGGUCUUACCGGAAA 


Luc 


24 


SEQ. ID 0234 


ACCGAAAGGUCUUACCGGA 


Luc 


25 


SEQ. ID 0235 


GUACCGAAAGGUCUUACCG 


Luc 


26 


SEQ. ID 0236 


AAGUACCGAAAGGUCUUAC 


Luc 


27 


SEQ. ID 0237 . 


CGAAGUACCGAAAGGUCUU 


Luc 


28 


SEQ. ID 0238 


GACGAAGUACCGAAAGGUC 


Luc 


29 


SEQ. ID 0239 


UGGACGAAGUACCGAAAGG 


Luc 


30 


SEQ, ID 0240 


UGUGGACGAAGUACCGAAA 


Luc 


31 


SEQ. ID 0241 


UUUGUGGACGAAGUACCGA 


Luc 


32 


SEQ. ID 0242 


UGUUUGUGGACGAAGUACC 


Luc 


33 


SEQ. ID 0243 


UGUGUUUGUGGACGAAGUA 


Luc 


34 


SEQ. ID 0244 


GLTUGUGUUUGUGGACGAAG 


Luc 


35 


SEQ. ID 0245 


GAGUUGUGUUUGUGGACGA 


Luc 


36 


SEQ. ID 0246 


AGGAGUUGUGUUUGUGGAC 


Luc 


37 


SEQ. ID 0247 


GGAGGAGUUGUGUUUGUGG 


Luc 


38 


SEQ. ID 0248 


GCGGAGGAGUUGUGUUUGU 


Luc 


39 


SEQ. ID 0249 


GCGCGGAGGAGUUGUGUUU 


Luc 


40 


SEQ. ID 0250 


UUGCGCGGAGGAGUUGUGU 


Luc 


41 


SEQ. ID 0251 


AGUUGCGCGGAGGAGUUGU 


Luc 


42 


SEQ. ID 0252 


AAAGUUGCGCGGAGGAGUU 


Luc 


43 


SEQ. ID 0253 


AAAAAGUUGCGCGGAGGAG 


Luc 


44 


SEQ. ID 0254 


CGAAAAAGUUGCGCGGAGG 


Luc 


45 


SEQ. ID 0255 


CGCGAAAAAGUUGCGCGGA 


Luc 


46 


SEQ. ID 0256 


ACCGCGAAAAAGUUGCGCG 


Luc 


47 


SEQ. ID 0257 


CAACCGCGAAAAAGUUGCG 


Luc 


48 


SEQ, ID 0258 


AACAACCGCGAAAAAGUUG 


Luc 


49 


SEQ. ID 0259 


GUAACAACCGCGAAAAAGU 


Luc 


50 


SEQ. ID 0260 


AAGUAACAACCGCGAAAAA 


Luc 


51 


SEQ. ID 0261 


UCAAGUAACAACCGCGAAA 


Luc 


52 


SEQ. ID 0262 


AGUCAAGUAACAACCGCGA 


Luc 


53 


SEQ. ID 0263 


CCAGUCAAGUAACAACCGC 



Luc 


54 


SEQ. ID 0264 


Luc 


55 


SEQ. ID 0265 


Luc 


56 


SEQ. ID 0266 


Luc 


57 


SEQ. ID 0267 


Luc 


58 


SEQ. ID 0268 


Luc 


59 


SEQ. ID 0269 


Luc 


60 


SEQ. ID 0270 


Luc 


61 


SEQ. ED 0271 


Luc 


62 


SEQ. ID 0272 


Luc 


63 


SEQ. ID 0273 


Luc 


64 


SEQ. ID 0274 


Luc 


65 


SEQ. ID 0275 


Luc 


66 


SEQ. ID 0276 


Luc 


67 


SEQ. ED 0277 


Luc 


68 


SEQ. ID 0278 


Luc 


69 


SEQ. ID 0279 


Luc 


70 


SEQ. ID 0280 


Luc 


71 


SEQ. ID 0281 


Luc 


72 


SEQ. ID 0282 


Luc 


73 


SEQ. ID 0283 


Luc 


74 


SEQ. ID 0284 


Luc 


75 


SEQ. ID 0285 


Luc 


76 


SEQ. ID 0286 


Luc 


77 


SEQ. ID 0287 


Luc 


78 


SEQ. ID 0288 


Luc 


79. 


SEQ. ID 0289 


Luc 


80 


SEQ. ID 0290 


Luc 


81 


SEQ. ID 0291 


Luc 


82 


SEQ. ID 0292 


Luc 


83 


SEQ. ID 0293 


Luc 


84 


SEQ. ID 0294 


Luc 


85 


SEQ. ID 0295 


Luc 


86 


SEQ. ED 0296 


Luc 


87 


SEQ. ID 0297 


Luc 


88 


SEQ. ID 0298 


Luc 


89 


SEQ. ID 0299 


Luc 


90 


SEQ. ID 0300 



CGCCAGUCAAGUAACAACC 

GUCGCCAGUCAAGUAACAA 

ACGUCGCCAGUCAAGUAAC 

UUACGUCGCCAGUCAAGUA 

GAUUACGUCGCCAGUCAAG 

UGGAUUACGUCGCCAGUCA 

CGUGGAUUACGUCGCCAGU 

AUCGUGGAUUACGUCGCCA 

AGAUCGUGGAUUACGUCGC 

AGAGAUCGUGGAUUACGUC 

AAAGAGAUCGUGGAUUACG 

AAAAAGAGAUCGUGGAUUA 

GGAAAAAGAGAUCGUGGAU 

ACGGAAAAAGAGAUCGUGG 

UGACGGAAAAAGAGAUCGU 

GAUGACGGAAAAAGAGAUC 

ACGAUGACGGAAAAAGAGA 

AGACGAUGACGGAAAAAGA 

AAAGACGAUGACGGAAAAA 

GGAAAGACGAUGACGGAAA 

ACGGAAAGACGAUGACGGA 

GCACGGAAAGACGAUGACG 

GAGCACGGAAAGACGAUGA 

UGGAGCACGGAAAGACGAU 

UUUGGAGCACGGAAAGACG 

GUUUUGGAGCACGGAAAGA 

UUGUUUUGGAGCACGGAAA 

UGUUGUUUUGGAGCACGGA 

GUUGUUGUUUUGGAGCACG 

CCGUUGUUGUUUUGGAGCA 

CGCCGUUGUUGUUIRJGGAG 

GCCGCCGUUGUUGUUUUGG 

CCGCCGCCGUUGUUGUUUU 

UCCCGCCGCCGUUGUUGUU 

CUUCCCGCCGCCGUUGUUG 

AACUUCCCGCCGCCGUUGU 

UGAACUUCCCGCCGCCGUU 



63 



Example II. Validation of the Algorithm using DBI, Luciferase, PLK, EGFR, 
and SEAP 

The algorithm (Formula VIII) identified siRNAs for five genes, human DBI, 
firefly luciferase (fLuc), renilla luciferase (rLuc), human PLK, and human secreted 
5 alkaline phosphatase (SEAP). Four individual siRNAs were selected on the basis of 
their SMARTscores^^ derived by analysis of their sequence using Formula VIII (all 
of the siRNAs would be selected with Formula IX as well) and analyzed for their 
ability to silence their targets' expression. In addition to the scoring, a BLAST search 
was conducted for each siRNA. To minimize the potential for off-target silencing 

10 effects, only those target sequences with more than three mismatches against un- 
related sequences were selected. Semizarov, et al, Specificity of short interfering RNA 
determined through gene expression signatures, Proc. Natl. Acad. Sci. U.S.A. 2003, 
100:6347. These duplexes were analyzed individually and in pools of 4 and 
compared with several siRNAs that were randomly selected. The functionality was 

1 5 measured a percentage of targeted gene knockdown as compared to controls. All 

siRNAs were transfected as described by the methods above at 100 nM concentration 
into HEK293 using Lipofectamine 2000. The level of the targeted gene expression 
was evaluated by B-DNA as described above and normalized to the non-specific 
control. Figure 10 shows that the siRNAs selected by the algorithm disclosed herein 

20 were significantly more potent than randomly selected siRNAs. The algorithm 

increased the chances of identifying an F50 siRNA firom 48% to 91%, and an F80 
siRNA firom 13% to 57%. In addition, pools of SMART siRNA silence the selected 
target better than randomly selected pools (see Figure 1 OF). 

25 Example III. Validation of the Algorithm Using Genes Involved in Clathrin- 
Dependent Endocytosis. 

Components of clathrin-mediated endocytosis pathway are key to modulating 
intracellular signaling and play important roles in disease. Chromosomal 
rearrangements that result in fusion transcripts between the Mixed-Lineage Leukemia 
30 gene (MLL) and CALM (Clathrin assembly lymphoid myeloid leukemia gene) are 
believed to play a role in leukemogenesis. Similarly, disruptions in Rab7 and Rab9, 
as well as HIPl (Hxmtingtin-interacting protein), genes that are believed to be 
involved in endocytosis, are potentially responsible for ailments resulting in lipid 
storage, and neuronal diseases, respectively. For these reasons, siRNA directed 
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against clathrin and other genes involved in the clathrin-mediated endocytotic 
pathway are potentially important research and therapeutic tools. 

siRNAs directed against genes involved in flie clathrin-mediated endocytosis 
5 pathways were selected using Formula VIIL The targeted genes were clathrin heavy 
chain (CHC, accession # NM_004859), clathrin light chain A (CLCa, NM_001833), 
clathrin light chain B (CLCb, NM__001834), CALM (U45976), (32 subunit of AP-2 
(P2, NM__001282), Epsl5 (NM__001981X EpslSR (NM_021235), dynamin II 
(DYNII, NM_004945), RabSa (BC001267), RabSb (NM_002868), RabSc 
10 (AF141304), and EEA.l (XM_018197). 

For each gene, four siRNAs duplexes with the highest scores were selected 
and a BLAST search was conducted for each of them using the Human EST database. 
In order to minimize the potential for off-target silencing effects, only those 

1 5 sequences with more than three mismatches against un-related sequences were used. 
All duplexes were synthesized at Dharmacon, Inc. as 21-mers with 3'-UU overhangs 
using a modified method of 2'-ACE chemistry Scaringe, Advanced S'-silyl-l'- 
orthoester approach to RNA oligonucleotide synthesis. Methods Enzymol 2000, 317:3 
and the antisense strand was chemically phosphorylated to insure maximized activity. 

20 ^. 

HeLa cells were grown in Dulbecco's modified Eagle's medium (DMEM) 
containing 10% fetal bovine serum, antibiotics and glutamine. siRNA duplexes were 
resuspended in IX siRNA Universal buffer (Dharmac6ri;Tnc.) to 20)aM prior to 
transfection. HeLa cells in 12-well plates were transfected twice with 4|al of 20|aM 

25 siRNA duplex in 3|xl Lipofectamine 2000 reagent (Invitrogen, Carlsbad, California, 
USA) at 24-hour intervals. For the transfections in which 2 or 3 siRNA duplexes were 
included, the amount of each duplex was decreased, so that the total amount was the 
same as in transfections with single siRNAs. Cells were plated into normal culture 
medium 12 hours prior to experiments, and protein levels were measured 2 or 4 days 

30 after the first transfection. 

Equal amounts of lysates were resolved by electrophoresis, blotted, and 
stained with the antibody specific to targeted protein, as well as antibodies specific to 
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unrelated proteins, PPl phosphatase and TsglOl (not shown). The cells were lysed in 
Triton X-lOO/glycerol solubilization buffer as described previously. Tebar, 
Bohlander, & Sorkin, Clathrin Assembly Lymphoid Myeloid Leukemia (CALM) 
Protein: Localization in Endocytic-coated Pits, Interactions with Clathrin, and the 
5 Impact of Overexpression on Clathrin-mediated Traffic, Mol. Biol. Cell Aug 1999, 
10:2687. Cell lysates were electrophoresed, transferred to nitrocellulose membranes, 
and Westem blotting was performed with several antibodies followed by detection 
using enhanced chemiluminescence system (Pierce, Inc). Several x-ray films were 
analyzed to determine the linear range of the chemiluminescence signals, and the 
10 quantifications were performed using densitometry and Alphalmager v5.5 software 
(Alpha Innotech Corporation). Li experiments with EpslSR-targeted siRNAs, cell 
lysates were subjected to immunoprecipitation with Ab860, and EpslSR was detected 
in inununoprecipitates by Westem blotting as described above. 

15 The antibodies to assess the levels of each protein by Westem blot were 

obtained fi-om the following sources: monoclonal antibody to clathrin heavy chain 
(TD.l) was obtained firom American Type Culture Collection (Rockville, MD, USA); 
polyclonal antibody to dynamin II was obtained firom Affinity Bioreagents, Inc. 
(Golden, CO, USA); monoclonal antibodies to EEA.l and RabSa were purchased 

20 fi-om BD Transduction Laboratories (Los Angeles, CA, USA); the monoclonal 

antibody to TsglOl was purchased firom Santa Cruz Biotechnology, Inc. (Santa Cruz, 
CA, USA); the monoclonal antibody to GFP was firom ZYMED Laboratories Inc. 
(South San Francisco, CA, USA); the rabbit polyclonaKantibodies Ab32 specific to a- 
adaptins and Ab20 to CALM were described previously Sorkin, et al, Stoichiometric 

25 Interaction of the Epidermal Growth Factor Receptor with the Clathrin-associated 

Protein Complex AP'2, J, BioL Chem, Jan 1995, 270:619, the polyclonal antibodies to 
clathrin light chains A and B were kindly provided by Dr. F. Brodsky (UCSF); 
monoclonal antibodies to PPl (BD Transduction Laboratories) and a-Actinin 
(Chemicon) were kindly provided by Dr. M. DeU'Acqua (University of Colorado); 

30 Epsl5 Ab577 and EpslSR Ab860 were kindly provided by Dr. P.P. Di Fiore 
(European Cancer Institute). 
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Figure 11 demonstrates the in vivo functionality of 48 individual siRNAs, 
selected using Formula VIII (most of them will meet the criteria incorporated by 
Formula IX as well) targeting 12 genes. Various cell lines were transfected with 
siRNA duplexes {Dupl'4) or pools of siRNA duplexes (Pool), and the cells were 
5 lysed 3 days after transfection with the exception of CALM (2 days) and P2 (4 days). 

Note a pl-adaptin band (part of AP-1 Golgi adaptor complex) that mns 
slightly slower than p2 adaptin. CALM has two splice variants, 66 and 72 kD. The 
full-length EpslSR (a doublet of -130 kD) and several truncated spliced forms of - 

10 1 00 kD and --70 kD were detected in Epsl 5R immunoprecipitates (shown by arrows). 
The cells were lysed 3 days after transfection. Equal amounts of lysates were 
resolved by electrophoresis and blotted with the antibody specific to a targeted protein 
(GFP antibody for YFP fusion proteins) and the antibody specific to unrelated 
proteins PPl phosphatase or a-actinin, and TSGIOI. The amount of protein in each 

1 5 specific band was normalized to the amount of non-specific proteins in each lane of 
the gel. Nearly all of them appear to be functional, which establishes that Formula 
VIII and IX can be used to predict siRNAs' functionality in general in a genome wide 
manner. 

20 To genef afe the fusion of yellow fluorescent protein (YFP) with RabSb or 

RabSc (YFP-Rab5b or YFP-Rab5c), a DNA fragment encoding the full-length human 
RabSb or Rab5c was obtained by PGR using Pfu polymerase (Stratagene) with a Sad 
restriction site introduced into the 5' end and SiKpnl siteH'ntoMM 3 ' end and cloned 
into pEYFP-Cl vector (CLONTECH, Palo Alto, CA, USA). GFP-CALM and YFP- 

25 RabSa were described previously Tebar, Bohlander, & Sorkin, Clathrin Assembly 

Lymphoid Myeloid Leukemia (CALM) Protein: Localization in Endocytic-coated Pits, 
Interactions with Clathrin, and the Impact of Overexpression on Clathrin-mediated 
Traffic, Mol. Biol. Cell Aug 1999, 10:2687. 

30 Example III. Validation of the Algorithm Using Eg5, GADPH, ATEl, MEK2, 
MEKl, QB, LaminA/C, c-myc, human cyclophilin, and mouse cyclophilin. 
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A number of genes have been identified as playing potentially important roles 
in disease etiology. Expression profiles of normal and diseased kidneys has 
implicated Edg5 in immunoglobulin A neuropathy, a conmion renal glomerular 
disease. Mycl, MEKl/2 and other related kinases have been associated with one or 
5 more cancers, while lamins have been implicated in muscular dystrophy and other 
diseases. For these reasons, siRNA directed against the genes encoding these classes 
of molecules would be important research and therapeutic tools. 

Figure 12 illustrates four siRNAs targeting 10 different genes (Table V for 
10 sequence and accession number information) that were selected according to the 
Formula VIII and assayed as individuals and pools in HEK293 cells. The level of 
siRNA induced silencing was measured using the B-DNA assay. These studies 
demonstrated that thirty-six out of the forty individual SMART-selected siRNA tested 
are functional (90%) and all 10 pools are fully functional. 

15 

Example V. Validation of the Algorithm Using Bcl2 

Bcl-2 is a ~25kD, 205-239 amino acid, anti-apoptotic protein that contains 
considerable homology with other members of the BCL family including BCLX, 
MCLl, BAX, BAD, and BIK. The protein exists in at least two forms (Bcl2a, which 

20 has a hydrophobic tail for membrane anchorage, and Bcl2b, which lacks the 

hydrophobic tail) and is predominantly localized to the mitochondrial membrane. 
While Bcl2 expression is widely distributed, particular interest has focused on the 
expression of this molecule in B and T cells. Bcl2 expressionas:down-regulated in 
normal germinal center B cells yet in a high percentage of follicular lymphomas, Bcl2 

25 expression has been observed to be elevated. Cytological studies have identified a 

common translocation ((14;18)(q32;q32)) amongst a high percentage (>70%) of these 
lymphomas. This genetic lesion places the Bcl2 gene in juxtaposition to 
immunoglobulin heavy chain gene (IgH) encoding sequences and is believed to 
enforce inappropriate levels of gene expression, and resistance to programmed cell 

30 death in the follicle center B cells. In other cases, hypomethylation of the Bcl2 

promoter leads to enhanced expression and again, inhibition of apoptosis. In addition 
to cancer, dysregulated expression of Bcl-2 has been correlated with multiple sclerosis 
and various neurological diseases. 
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The correlation between Bcl-2 translocation and cancer makes this gene an 
attractive target for RNAi. Identification of siRNA directed against the bcl2 transcript 
(or Bcl2-IgH fusions) would further our understanding Bcl2 gene function and 
possibly provide a future therapeutic agent to battle diseases that result firom altered 
5 expression or function of this gene. 

In Silico Identification of Functional siRNA 

To identify functional and hyperfunctional siRNA against the Bcl2 gene, the 
sequence for Bcl-2 was downloaded from the NCBI Unigene database and analyzed 
10 using the Formula VIII algorithm. As a result of these procedures, both the sequence 
and SMARTscores™ of the Bcl2 siRNA were obtained and ranked according to their 
functionality. Subsequently, these sequences were BLAST' ed (database) to insure that 
the selected sequences were specific and contained minimal overlap with unrealated 
genes. The SMARTscores^M for the top 10 Bcl-2 siRNA are identified in Figure 13. 

15 

In Vivo Testing of Bcl-2 SiRNA 

Bcl-2 siRNAs having the top ten SMARTscores™ were selected and tested in 
a functional assay to determine silencing efficiency. To accomplish this, each of the 
ten duplexes were synthesized using 2' -O- ACE chemistry and transfected at lOOnM 
20 concentrations into cells,:-Twenty-four hours later assays were performed on cell 
extracts to assess the degree of target silencing. Controls used in these experiments 
included mock transfected cells, and cells that were transfected with a non-specific 
siRNA duplex. -^v.. -.:,i*v~^-~ 

25 The results of these experiments are presented below (and in Figure 14) and 

show that all ten of the selected siRNA induce 80% or better silencing of the Bcl2 
message at lOOnM concentrations. These data verify that the algorithm successfully 
identified functional Bcl2 siRNA and provide a set of functional agents that can be 
used in experimental and therapeutic environments. 

30 
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siRNA 
siRNA 
siRNA 
siRNA 
SiRNA 
SiRNA 
siRNA 
SiRNA 
siRNA 
siRNA 



1 GGGAGAUAGUGAUGAAGUA SEQ. IDNO. 301 

2 GAAGUACAUCCAUUAUAAG SEQ. ID NO. 302 

3 GUACGACAACCGGGAGAUA SEQ. ID NO. 303 

4 AGAUAGUGAUGAAGUACAU SEQ. ID NO. 304 

5 UGAAGACUCUGCUCAGUUU SEQ. ID NO. 305 

6 GCAUGCGGCCUCUGUUUGA SEQ. ID NO. 306 

7 UGCGGCCUCUGUUUGAUUU SEQ. ID NO. 307 

8 GAGAUAGUGAUGAAGUACA SEQ. ID NO. 308 

9 GGAGAUAGUGAUGAAGUAC SEQ. IDNO. 309 

SEQ. IDNO. 310 



10 GAAGACUCUGCUCAGUUUG 



10 



Bcl2 siRNA: Sense Strand, 5' ^3' 
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Example VI. Sequences Selected by the Algorithm 

Sequences of the siRNAs selected using Formulas (Algorithms) VIII and IX 
with their corresponding ranking, which have been evaluated for the silencing activity 
in vivo in the present study (Formula VIII and IX, respectively). 



TABLE V 



Gene 


Accession 






Formula 


Formula 


Name 


Number 


SEQ. ID NO. 


FTllSeqTence 


VIII 


IX 



CLTC 

CLTC 

CLTC 

CLTC 

CLTA 

CLTA 

CLTA 

CLTA 

CLTB 

CLTB 

CLTB 

CLTB 

CALM 

CALM 

CALM 

CALM 

EPS15 



NM_004859 

NM_004859 

NM_004859 

NM_004859 

NM_001833 

NM_001833 

NM_001833 

NM_001833 

NM_001834 

NM__001834 

NM_o6l834 

NM_001834 

U45976 

U45976 

U45976 

U45976 

NM 001981 



SEQ. roNO. 0301 
SEQ. ID NO. 0302 
SEQ. ID NO. 0303 
SEQ. ID NO. 0304 
SEQ. ID NO. 0305 
SEQ. ID NO, 0306 
SEQ. ID NO. 0307 
SEQ. ID NO. 0308 
SEQ. ID NO. 0309 
SEQ. IDNO. 0310 
SEQ. IDNO. 0311 
SEQ. IDNO. 0312 
SEQ. IDNO. 0313 
SEQ. IDNO. 0314 
SEQ. ID NO. 0315 
SEQ. IDNO. 0316 
SEQ. IDNO. 0317 



GAAAGAATCTGTAGAGAAA 76 94.2 

GCAATGAGCTGTTTGAAGA 65 39.9 

TGACAAAGGTGGATAAATT 57 38.2 

GGAAATGGATCTCTTTGAA. 54 49.4 

GGAAAGTAATGGTCCAACA 22 55.5 

AGACAGTTATGCAGCTATT 4 22.9 

CCAATTCTCGGAAGCAAGA 1 17 

GAAAGTAATGGTCCAACAG -1 -13 

GCGCCAGAGTGAACAAGTA 17 57.5 

GAAGGTGGCCCAGCTATGT 15 -8.6 

GGAACCAGCGCCAGAGTGA 13 40.5 

GAGCGAGATTGCAGGCATA 20 61.7 

GTTAGTATCTGATGACTTG 36 -34.6 

GAAATGGAACCACTAAGAA 33 46.1 

GGAAATGGAACCACTAAGA 30 61.2 

CAACTACACTTTCCAATGC 28 6.8 

CCACCAAGATTTCATGATA 48 25.2 



EPS 15 


NM_001981 


SEQ. ID NO. 0318 


GATCGGAACTCCAACAAGA 


43 


49.3 


EPS 15 


NM_001981 


SEQ.IDNO. 0319 


AAACGGAGCTACAGATTAT 


39 


11.5 


EPS15 


NM_001981 


SEQ. ID NO. 0320 


CCACACAGCATTCTTGTAA 


33 


-23.6 


EPS15R 


NM_021235 


SEQ.IDNO. 0321 


GAAGTTACCTTGAGCAATC 


48 


33 


EPS15R 


NM_021235 


SEQ. ID NO, 0322 


GGACTTGGCCGATCCAGAA 


27 


33 


EPS15R 


NM_021235 


SEQ.IDNO. 0323 


GCACTTGGATCGAGATGAG 


20 


1.3 


EPS15R 


NM_021235 


SEQ. ID NO. 0324 


CAAAGACCAATTCGCGTTA 


17 


27,7 


DNM2 


NM_004945 


SEQ. ID NO. 0325 


CCGAATCAATCGCATCTTC 


6 


-29.6 


DNM2 


NM_004945 


SEQ. ID NO. 0326 


GACATGATCCTGCAGTTCA 


5 


-14 


DNM2 


NM_004945 


SEQ. ID NO. 0327 


GAGCGAATCGTCACCACTT 


5 


24 


DNM2 


NM_004945 


SEQ. ID NO. 0328 


CCTCCGAGCTGGCGTCTAC 


-4 


-63.6 


ARF6 


AF93885 


SEQ. ID NO. 0329 


TCACATGGTTAACCTCTAA 


27 


-21.1 


ARF6 


AF93885 


SEQ. ID NO. 0330 


GATGAGGGACGCCATAATC 


7 


-38.4 


ARF6 


AF93885 


SEQ. ID NO. 0331 


CCTCTAACTACAAATCTTA 


4 


16,9 


ARF6 


AF93885 


SEQ. ID NO. 0332 


GGAAGGTGCTATCCAAAAT 


4 


11.5 


RAB5A 


BC001267 


SEQ. ID NO. 0333 


GCAAGCAAGTCCTAACATT 


40 


25,1 


RAB5A 


BC001267 


SEQ. ID NO. 0334 


GGAAGAGGAGTAGACCTTA 


17 


50.1 


RAB5A 


BC001267 


SEQ. ID NO. 0335 


AGGAATCAGTGTTGTAGTA 


16 


11.5 


RAB5A 


BC001267 


SEQ. ID NO. 0336 


GAAGAGGAGTAGACCTTAC 


12 


7 


RAB5B 


NM_002868 


SEQ. ID NO. 0337 


GAAAGTCAAGCCTGGTATT 


14 


18.1 


RAB5B 


NM_002868 


SEQ. ID NO. 0338 


AAAGTCAAGCCTGGTATTA 


6 


-17.8 


RAB5B 


NM__002868 


SEQ. ID NO. 0339 


GCTATGAACGTGAATGATC 


3 


-21.1 


RAB5B 


NM_002868 


SEQ. ID NO. 0340 


CAAGCCTGGTATTACGTTT 


-7. 


-37.5 


RAB5C 


AF141304 


SEQ.^HD NO. 0341 


GGAACAAGATCTGTCAATT 


38 


51,9 


RAB5C 


AF 14 1304 


SEQ. ID NO. 0342 


GCAATGAACGTGAACGAAA 


29 


43.7 


RAB5C 


AF141304 


SEQ. ID NO. 0343 


CAATGAACGTGAACGAAAT 


18 


43.3 


RAB5C 


AF141304 


SEQ. ID NO. 0344 


GGACAGGAGCGGTATCACA 


6 


18.2 


EEAl 


XM_018197 


SEQ, ID NO. 0345 


AGACAGAGCTTGAGAATAA 


67 


64.1 


EEAl 


XM_018197 


SEQ. ID NO. 0346 


GAGAAGATCTTTATGCAAA 


60 


48.7 


EEAl 


XM_018197 


SEQ, ID NO, 0347 


GAAGAGAAATCAGCAGATA 


58 


45.7 


EEAl 


XM_018197 


SEQ. ID NO. 0348 


GCAAGTAACTCAACTAACA 


56 


72.3 


AP2B1 


NM_001282 


SEQ. ID NO. 0349 


GAGCTAATCTGCCACATTG 


49 


-12.4 


AP2B1 


NM_001282 


SEQ. ID NO. 0350 


GCAGATGAGTTACTAGAAA 


44 


48.9 


AP2B1 


NM_001282 


SEQ.IDNO. 0351 


CAACTTAATTGTCCAGAAA 


41 


28.2 


AP2B1 


NM_001282 


SEQ. ID NO. 0352 


CAACACAGGATTCTGATAA 


33 


-5.8 


PLK 


NM_005030 


SEQ. ID NO. 0353 


AGATTGTGCCTAAGTCTCT 


-35 


-3.4 


PLK 


NM_005030 


SEQ. ID NO. 0354 


ATGAAGATCTGGAGGTGAA 


0 


-4.3 


PLK 


NM_005030 


SEQ. ID NO. 0355 


TTTGAGACTTCTTGCCTAA 


-5 


-27.7 


PLK 


NM_005030 


SEQ. ID NO. 0356 


AGATCACCCTCCTTAAATA 


15 


72.3 


GAPDH 


NM_002046 


SEQ. ID NO. 0357 


CAACGGATTTGGTCGTATT 


27 


-2.8 



71 



GAPDH 


NM_002046 


SEQ. ID NO. 0358 


GAAATCCCATCACCATCTT 


24 


3.9 


GAPDH 


NM_002046 


SEQ. ID NO. 0359 


GACCTCAACTACATGGTTT 


22 


-22.9 


GAPDH 


NM_002046 


SEQ. ID NO. 0360 


TGGTTTACATGTTCCAATA 


9 


9.8 


c-Myc 




SEQ. ID NO. 0361 


GAAGAAATCGATGTTGTTT 


31 


-11.7 


c-Myc 




SEQ. ID NO. 0362 


ACACAAACTTGAACAGCTA 


22 


51.3 


c-Myc 




SEQ. ID NO. 0363 


GGAAGAAATCGATGTTGTT 


18 


26 


c-Myc 




SEQ. ID NO. 0364 


GAAACGACGAGAACAGTTG 


18 


-8.9 


MAP2K1 


NM_002755 


SEQ. ID NO. 0365 


GCACATGGATGGAGGTTCT 


26 


16 


MAP2K1 


NM_002755 


SEQ. ID NO. 0366 


GCAGAGAGAGCAGATTTGA 


16 


0.4 


MAP2K1 


NM_002755 


SEQ. ID NO. 0367 


GAGGTTCTCTGGATCAAGT 


14 


15.5 


MAP2K1 


NM_002755 


SEQ. ID NO. 0368 


GAGCAGATTTGAAGCAACT 


14 


18.5 


MAP2K2 


NM_030662 


SEQ. ID NO. 0369 


CAAAGACGATGACTTCGAA 


37 


26.4 


MAP2K2 


NM_030662 


SEQ. ID NO. 0370 


GATCAGCATTTGCATGGAA 


24 


-0.7 


MAP2K2 


NM_030662 


SEQ. ID NO. 0371 


TCCAGGAGTTTGTCAATAA 


17 


-4.5 


MAP2K2 


NM_030662 


SEQ. ID NO. 0372 


GGAAGCTGATCCACCTTGA 


16 


59.2 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0373 


GCAGAAATCTAAGGATATA 


53 


35.8 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0374 


CAACAAGGATGAAGTCTAT 


50 


18.3 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0375 


CAGCAGAAATCTAAGGATA 


41 


32.7 


KNSL1(EG5) 


NM_004523 


SEQ. ID NO. 0376 


CTAGATGGCTTTCTCAGTA 


39 


3.9 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0377 


AGACAAGGTCCCAAAGACA 


-16 


58.1 


CyclophiliiiA_ 


NM_021130 


SEQ. ID NO. 0378 


GGAATGGCAAGACCAGCAA 


-6 


36 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0379 


AGAATTATTCCAGGGTTTA 


-3 


16.1 


CyclophilinA_ 


NM_021130 


SEQ. ID NO. 0380 


GCAGACAAGGTCCCAAAGA 


8 


8.9 


LAMIN A/C 


NM_170707 


SEQ. ID NO. 0381 


AGAAGCAGCTTCAGGATGA 


31 


38.8 


LAMINA/C 


NM_1 70707 


SEQ. ID NO. 0382 


GAGCTTGACTTCCAGAAGA 


33 


22.4 


LAMIN A/C 


NM_1 70707 


SEQ. ID NO. 0383 


CCACCGAAGTTCACCCTAA 


21 


27.5 


LAMIN A/C 


NM_1 70707 


SEQ. ID NO. 0384 


GAGAAGAGCTCCTCCATCA 


55 


30.1 


CyclophilinB 


M60857 


SEQ. ID NO. 0385 


GAAAGAGCATCTACGGTGA 


41 


83.9 


CyclophilinB 


M60857 


SEQ, ID NO. 0386 


GAAAGGATTTGGCTACAAA 


53 


59.1 


CyclophilinB 


M60857 


SEQ. ID NO. 0387 


ACAGCAAATTCCATCGTGT 


-20 


28.8 


CyclophilinB 


M60857 


SEQ. ID NO. 0388 


GGAAAGACTGTTCCAAAAA 


2 


27 


DBIl 


NM_020548 


- SEQ. ID NO. 0389 


CAACACGCCTCATCCTCTA 


27 


-7.6 


DBI2 


NM_020548 


SEQ. ID NO. 0390 


CATGAAAGCTTACATCAAC 


25 


-30.8 


DBI3 


NM_020548 


SEQ. ID NO. 0391 


AAGATGCCATGAAAGCTTA 


17 


22 


DBI4 


NM_020548 


SEQ. ID NO. 0392 


GCACATACCGCCTGAGTCT 


15 


3.9 


rLUCI 




SEQ. ID NO. 0393 


GATCAAATCTGAAGAAGGA 


57 


49.2 


rLUC2 




SEQ. ID NO. 0394 


GCCAAGAAGTTTCCTAATA 


50 


13.7 


rLUC3 




SEQ. ID NO. 0395 


CAGCATATCTTGAACCATT 


41 


-2.2 


rLUC4 




SEQ. ID NO. 0396 


GAACAAAGGAAACGGATGA 


39 


29.2 


SeAPl 


NM_031313 


SEQ. ID NO. 0397 


CGGAAACGGTCCAGGCTAT 


6 


26.9 



72 



SeAP2 


NM_031313 


SEQ. ID NO. 0398 


GCTTCGAGCAGACATGATA 


4 


-11.2 


SeAP3 


NM_031313 


SEQ. ID NO. 0399 


CCTACACGGTCCTCCTATA 


4 


4.9 


SeAP4 


NM_031313 


SEQ. ID NO. 0400 


GCCAAGAACCTCATCATCT 


1 


-9.9 


fLUCl 




SEQ. ID NO. 0401 


GATATGGGCTGAATACAAA 


54 


40.4 


fLUC2 




SEQ. ID NO. 0402 


GCACTCTGATTGACAAATA 


47 


54.7 


fLUC3 




SEQ. ID NO. 0403 


TGAAGTCTCTGATTAAGTA 


46 


34.5 


fLUC4 




SEQ. ID NO. 0404 


TCAGAGAGATCCTCATAAA 


40 


11.4 


mCyclol 


NM_008907 


SEQ. ID NO. 0405 


GCAAGAAGATCACCATTTC 


52 


46.4 


mCyclo_2 


NM_008907 


SEQ. ID NO. 0406 


GAGAGAAATTTGAGGATGA 


36 


70.7 


mCyclo_3 


NM_008907 . 


SEQ. ID NO, 0407 


GAAAGGATTTGGCTATAAG 


35 


-1.5 


mCyclo_4 


NM_008907 


SEQ. ID NO. 0408 


GAAAGAAGGCATGAACATT 


27 


10.3 


BCL2_^1 


NM_000633 


SEQ. ID NO. 0409 


GGGAGATAGTGATGAAGTA 


21 


72 


BCL2_2 


NM_000633 


SEQ. ID NO. 0410 


GAAGTACATCCATTATAAG 


1 


3.3 


BCL2_3 


NM_000633 


SEQ. ID NO. 0411 


GTACGACAACCGGGAGATA 


1 


35.9 


BCL2_4 


NM_000633 


SEQ. ID NO. 0412 


AGATAGTGATGAAGTACAT 


-12 


22.1 


BCL2_5 


NM_000633 


SEQ. ID NO. 0413 


TGAAGACTCTGCTCAGTTT 


36 


19.1 


BCL2_6 


NM_000633 


SEQ. ID NO. 0414 


GCATGCGGCCTCTGTTTGA 


5 


-9.7 


QBl 


NM_003365.1 


SEQ. ID NO, 0415 


GCACACAGCUUACUACAUC 


52 


-4.8 


QB2 


NM_003365.1 


SEQ. ID NO. 0416 


GAAAUGCCCUGGUAUCUCA 


49 


22.1 


QB3 


NM_003365.1 


SEQ. ID NO. 0417 


GAAGGAACGUGAUGUGAUC 


34 


22,9 


QB4 


NM_003365.1 


SEQ. ID NO, 0418 


GCACUACUCCUGUGUGUGA 


28 


20.4 


ATEl-1 


NM_007041 


SEQ. ID NO. 0419 


GAACCCAGCUGGAGAACUU 


45 


15.5 


ATEl-2 


NM_007041 


SEQ. ID NO. 0420 


GAUAUACAGUGUGAUCUUA 


40 


12.2 


ATEl-3 


NM_007041 


SEQ. ID NO. 0421 


GUACUACGAUCCUGAUUAU 


37 


32.9 


ATEl-4 


NM_007041 


SEQ. ID NO. 0422 


GUGCCGACCUUUACAAUUU 


35 


18.2 


EGFR-1 


NM_005228 


SEQ. ID NO. 0423 


GAAGGAAACTGAATTCAAA 


68 


79.4 


EGFR-1 


NM_005228 


SEQ. ID NO. 0424 


GGAAATATGTACTACGAAA 


49 


49.5 


EGFR-1 


NM_005228 


SEQ, ID NO. 0425 


CCACAAAGCAGTGAATTTA 


41 


7.6 


EGFR-1 


NM_005228 


SEQ. ID NO. 0426 


GTAACAAGCTCACGCAGTT 


40 


25.9 



Example VII. Genome-Wide Application of the Algorithm 

The examples described above demonstrate that the algorithm(s) can 
5 successfully identify functional siRNA and that these duplexes can be used to induce 
the desirable phenotype of transcriptional knockdown or knockout. Each gene or 
family of genes in each organism plays an important role in maintaining physiological 
homeostasis and the algorithm can be used to develop functional, highly functional, or 
hyperfunctional siRNA to each gene. To accomplish this for the human genome, the 
10 entire online ncbi refseq database was accessed through Entrez (efetch). The database 



73 



was processed through Formula VIIL For each gene the top 80 -100 scores for 
siRNAs were obtained and BLAST' ed to insure that the selected sequences are 
specific in targeting the gene of choice. These sequences are provided on the 
enclosed CDs in electronic form. 

5 

With respect to the disks, there are four tables on each disk copy in text 
format: Tables XII -XV. Table XII, which is located in a file entitled Table__12,txt, 
provides a list of the 80-100 sequences for each target, identified by Formula VIII as 
having the highest relative SMARTscores™ for the target analyzed. Table XIII, 

10 which is located in a file entitled Table l 3.txt, provides the SMARTscores™, and for 
each gene, a pool pick of up to four sequences is denoted. (The denotation of"!" in 
Table XIII means that it is a pool pick.) These pool pick sequences represent the most 
functional siRNAs for the corresponding target. Any 1, 2, 3, or 4 of the pool pick 
sequences could be used for gene silencing. Further, sequences that are not denoted 

15 as pool pick sequences, but that are included on the compact disks may also be used 
for gene silencing either alone or in combination with other sequences. However, 
their individual relative fimctionality would be less than that of a pool pick sequence. 
Table XIV, which is located in a file entitled Table_14.txt, provides an identification 
of genes by accession number, and Table XV, which is located in a file entitled 

20 Table 15.txt, provides a short name for the/genes identified on the disk. The 
information contained on the disks is part of this patent application and are 
incorporated into the specification by reference. One may use these tables in order to 
identify functional siRNAs for the gene provided therein, by simply looking for the 
gene of interest and an siRNA that is listed as functional. Preferably, one would 

25 select one or more of the siRNA that most optimized for the target of interest and is 
denoted as a pool pick. 



Table XII: siRNA selected by Formula VIII 

See data submitted herewith on a CD-ROM in accordance with PCT 
30 Administrative Instructions Section 801(a) 

Table XIII: SMARTscores™ 

See data submitted herewith on a CD-ROM in accordance with PCT 
Administrative Instructions Section 801(a) 
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Table XIV: Identification of Targets 

See data submitted herewith on a CD-ROM in accordance with PCX 
Administrative Instructions Section 801(a) 



Table XV: Description of Targts 

See data submitted herewith on a CD-ROM in accordance with PCX 
Administrative Instructions Section 801(a) 



10 Many of the genes to which the described siRNA are directed play critical 

roles in disease etiology. For this reason, the siRNA listed in the accompanying 
compact disk may potentially act as therapeutic agents. A number of prophetic 
examples follow and should be understood in view of the siRNA that are identified on 
the accompanying CD. Xo isolate these siRNA, the appropriate message sequence for 

15 each gene is analj^ed using one of the before mentioned formulas (preferably formula 
VIII) to identify potential siRNA targets. Subsequently these targets are BLASX'ed to 
eliminate homology with potentially off-targets. 



Xhe list of potential disease targets is extensive. For instance, over-expression 
20 of Bel 10 has been implicated in the developm^t of MALX lymphoma (mucosa 

associated lymphoid tissue lymphoma) and thus, functional, highly functional, or 

hyperfunctional siRNA directed against that gene (e,g, SEQ. ID NO. 0427: 

GGAAACCUCUCAUUGCUAA; SEQ. ID NO. 0428: 

GAAAGAACCUUGCCGAUCA; SEQ. ID NO. 0429: 
25 GGAAAUACAUCAGAGCUUA, or SEQ. ID NO. 0430: 

GAAAGUAUGUGUCUUAAGU) may contribute to treatment of this disorder. 

In another example, studies have shown that molecules that inhibit 
glutamine:fiructose-6-phosphate aminotransferase (GFA) may act to limit the 
30 symptoms suffered by Xype II diabetics. Xhus, functional, highly functional, or 

hyperfunctional siRNA directed against GFA (also known as GFPXl : siRNA = SEQ. 
ID NO. 0433 UGAAACGGCUGCCUGAUUU; SEQ. ID NO. 0434 
GAAGUUACCUCUUACAUUU; SEQ. ID NO. 0435 
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GUACGAAACUGUAUGAUUA; SEQ, ID NO. 0436 

GGACGAGGCUAUCAUUAUG) may contribute to treatment of this disorder. 

In another example, the von Hippel-Lindau (VHL) tumor suppressor has been 
5 observed to be inactivated at a high frequency in sporadic clear cell renal cell 
carcinoma (RCC) and RCCs associated with VHL disease. The VHL tumor 
suppressor targets hypoxia-inducible factor- 1 alpha (HIF-1 alpha), a transcription 
factor that can induce vascular endothelial growth factor (VEGF) expression, for 
ubiquitination and degradation. Inactivation of VHL can lead to increased levels of 

10 HIF-1 alpha, and subsequent VEGF over expression. Such over expression of VEGF 
has been used to explain the increased (and possibly necessary) vascularity observed 
in RCC. Thus, functional, highly functional, or hyperfiinctional siRNAs directed^ 
against either HIF-1 alpha (SEQ. ID NO. 0437 GAAGGAACCUGAUGCUUUA; 
SEQ. ID NO. 0438 GCAUAUAUCUAGAAGGUAU; SEQ. ID NO. 0439 

15 GAACAAAUACAUGGGAUUA; SEQ. ID NO. 0440 

GGACACAGAUUUAGACUUG) or VEGF (SEQ. ID NO. 0441 
GAACGUACUUGCAGAUGUG; SEQ. ID NO. 0442 
GAGAAAGCAUUUGUUUGUA; SEQ. ID NO. 0443 
GGAGAAAGCAUUUGUUUGU; SEQ. ID NO. 0444 

20 CGAGGCAGCUUGAGUUAAA) may be usefulJn the treatment of renal cell 
carcinoma. 

... ^ In another example, gene expression of platelet derived growth factor A and B 
(PDGF-A and PDGF-B) has been observed to be increased 22- and 6-fold, 

25 respectively, in renal tissues taken from patients with diabetic nephropathy as 

compared with controls. These findings suggest that over expression of PDGF A and 
B may play a role in the development of the progressive fibrosis that characterizes 
human diabetic kidney disease. Thus, functional, highly functional, or hyperfunctional 
siRNAs directed against either PDGF A 

30 (SEQ. ID NO. 0445: GGUAAGAUAUUGUGCUUUA; 
SEQ. ID NO. 0446: CCGCAAAUAUGCAGAAUUA; 
SEQ. ID NO. 0447: GGAUGUACAUGGCGUGUUA; 
SEQ. ID NO. 0448: GGUGAAGUUUGUAUGUUUA) or 
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PDGFB 

(SEQ. ID NO. 0449: CCGAGGAGCUUUAUGAGAU; 
SEQ. ID NO. 0450: GCUCCGCGCUUUCCGAUUU; 
SEQ. ID NO. 0451 GAGCAGGAAUGGUGAGAUG; 
5 SEQ. ID NO. 0452: GAACUUGGGAUAAGAGUGU; 
SEQ. ID NO. 0453 CCGAGGAGCUUUAUGAGAU; 

SEQ. ID NO. 0454 UUUAUGAGAUGCUGAGUGA) may be usefiil in the treatment 
of this form of kidney disorder. 

10 In another example, a strong correlation exists between the over-expression of 

glucose transporters (e.g. GLUT12) and cancer cells. It is predicted that cells 
undergoing uncontrolled cell growth up-regulate GLUT molecules so that they can " 
cope with the heightened energy needs associated with increased rates of proliferation 
and metastasis. Thus, siRNA-based therapies that target the molecules such as 

15 GLUTl (also known as SLC2A1 : siRNA= 

SEQ. ID NO.: 0455 GCAAUGAUGUCCAGAAGAA; 
SEQ. ID NO.: 0456 GAAGAAUAUUCAGGACUUA; 
SEQ. ID NO.: 0457 GAAGAGAGUCGGCAGAUGA; 
SEQ. ID NO.: 0458 CCAAGAGUGUGCUAAAGAA) 

20 

GLUT12 (also known as SLCA12: siRNA = 
SEQ. ID NO. 0459: GAGACACUCUGAAAUGAUA; 
SEQ. ID.NO. 0460: GAAAUGAUGUGGAUAAGAG; 
SEQ. ID NO. 0461: GAUCAAAUCCUCCCUGAAA; 
25 SEQ. ID NO. 0462: UGAAUGAGCUGAUGAUUGU) and other related transporters, 
may be of value in treating a multitude of malignancies. 

The siRNA sequences listed above are presented in a 5'^ 3' sense strand 
direction. In addition, siRNA directed against the targets listed above as well as those 
30 directed against other targets and listed in the accompanying compact disk may be 
useful as therapeutic agents. 

Example VIII. Evidence for the Benefits of Pooling 
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Evidence for the benefits of pooling have been demonstrated using the 
reporter gene, luciferase. Ninety siRNA duplexes were synthesized using Dharmacon 
proprietary ACE® chemistry against one of the standard reporter genes: firefly 
luciferase. The duplexes were designed to start two base pairs apart and to cover 
approximately 180 base pairs of the luciferase gene (see sequences in Table III). 
Subsequently, the siRNA duplexes were co-transfected with a luciferase expression 
reporter plasmid into HEK293 cells using standard transfection protocols and 
luciferase activity was assayed at 24 and 48 hours. 

Transfection of individual siRNAs showed standard distribution of inhibitory 
effect. Some duplexes were active, while others were not. Figure 15 represents a 
typical screen of ninety siRNA duplexes (SEQ. ID NO. 0032- 0120) positioned two 
base pairs apart. As the figure suggests, the functionality of the siRNA duplex is 
determined more by a particular sequence of the oligonucleotide than by the relative 
oligonucleotide position within a gene or excessively sensitive part of the mRNA, 
which is important for traditional anti-sense technology. 

When two continuous oligonucleotides were pooled together, a significant 
increase in gene silencing activity was observed. (See Figure 16) A gradual increase 
in efficacy and the firequency of pools fiinctionality was observed when the number of 
siRNAs increased to 3 and 4. (Figures 16, 17). Further, the relative positioning of 
the oligonucleotides within a pool did not determine whether a particular pool was 
functional (see Figure 18, in which 100% of pools of oligonucleotides distanced by 2, 
10 and 20 base pairs were functional). 

However, relative positioning may nonetheless have an impact. An increased 
functionality may exist when the siRNA are positioned continuously head to toe (5' 
end of one directly adjacent to the 3' end of the others). 

Additionally, siRNA pools that were tested performed at least as well as the 
best oligonucleotide in the pool, under the experimental conditions whose results are 
depicted in Figure 19. Moreover, when previously identified non-functional and 
marginally (semi) functional siRNA duplexes were pooled together in groups of five 
at a time, a significant functional cooperative action was observed. (See Figure 20) 



In fact, pools of semi-active oligonucleotides were 5 to 25 times more functional than 
the most potent oligonucleotide in the pool. Therefore, pooling several siRNA 
duplexes together does not interfere with the functionality of the most potent siRNAs 
within a pool, and pooling provides an unexpected significant increase in overall 
5 functionality 

Example DC. Pooling Across Species 

Experiments were performed on the following genes: |3-galactosidase, Renilla 
luciferase, and Secreted alkaline phosphatase, which demonstrates the benefits of 
10 pooling, (see Figure 21) Approximately 50% of individual siRNAs designed to 
silence the above-specified genes were functional, while 1 00% of the pools that 
contain the same siRNA duplexes were functional. 

Example X. Highly Functional siRNA 

15 Pools of five siRNAs in which each two siRNAs overlap to 10-90% resulted 

in 98% functional entities (>80% silencing). Pools of siRNAs distributed throughout 
the mRNA that were evenly spaced, covering an approximate 20 - 2000 base pair 
range, were also functional. When the pools of siRNA were positioned continuously 
head to tail relative to mRNA sequences and mimicked the natural products of Dicer 

20 cleaved long double stranded RNA, 98% of the pools evidenced highly functional 
activity (>95% silencing). 

Example-'Xr.'^Human cyclophyline 

Table III above lists the siRNA sequences for the human cyclophyline 
25 protein. A particularly functional siRNA may be selected by applying these 
sequences to any of Formula I to VII above. 

Alternatively, one could pool 2, 3, 4, 5 or more of these sequences to create a 
kit for silencing a gene. Preferably, within the kit there would be at least one 
30 sequence that has a relatively high predicted functionality when any of Formulas I - 
VII is applied. 

Example XIL Sample Pools of siELNAs and Their Application to Human Disease 
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The genetic basis behind human disease is well documented and siRNA may 
be used as both research or diagnostic tools and therapeutic agents, either individually 
or in pools. Genes involved in signal transduction, the immune response, apoptosis, 
DNA repair, cell cycle control, and a variety of other physiological functions have 
5 clinical relevance and therapeutic agents that can modulate expression of these genes 
may alleviate some or all of the associated symptoms. In some instances, these genes 
can be described as a member of a family or cIeiss of genes and siRNA (randomly, 
conventionally, or rationally designed) can be directed against one or multiple 
members of the family to induce a desired result. 

10 

To identify rationally designed siRNA to each gene, the sequence was 
analyzed using Formula VIII to identify a SMARTpool containing the functional 
sequences. To confirm the activity of these sequences, the siRNA are introduced into 
a cell type of choice (e,g, HeLa cells, HEK293 cells) and the levels of the appropriate 

15 message are analyzed using one of several art proven techniques. SiRNA having 

heightened levels of potency can be identified by testing each of the before mentioned 
duplexes at increasingly limiting concentrations. Similarly, siRNA having increased 
levels of longevity can be identified by introducing each duplex into cells and testing 
functionality at 24, 48, 72, 96, 120, 144, 168, and 192 hours after transfectiori. Agents 

20 that induce >95% silencing at sub-nanomolar concentrations anchor induce functional 
levels of silencing for >96 hours are considered hyperfiinctional. 

Thei foUamng are non-limiting examples of families of proteins to which 
siRNA described in this document are targeted against: 

25 

Transporters, Pumps, and Channels 

Transporters, pumps, and channels represent one class of genes that are 
30 attractive targets for siRNAs. One major class of transporter molecules are the ATP- 
binding cassette (ABC) transporters. To date, nearly 50 human ABC-transporter 
genes have been characterized and have been shown to be involved in a variety of 
physiological functions including transport of bile salts, nucleosides, chloride ions, 
cholesterol, toxins, and more. Predominant among this group are MDRl (which 
35 encodes the P-glycoprotein, NP_000918), the MDR-related proteins (MRPl-7), and 
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the breast cancer resistance protein (BCRP), In general, these transporters share a 
common structure, with each protein containing a pair of ATP -binding domains (also 
known as nucleotide binding folds, NBF) and two sets of transmembrane (TM) 
domains, each of which typically contains six membrane-spanning a-helices. The 
5 genes encoding this class of transporter are organized as either full transporters (Le. 
containing two TM and two NBF domains) or as half transporters that assemble as 
either homodimers or heterodimers to create functional transporters. As a whole, 
members of the family are widely dispersed throughout the genome and show a high 
degree of amino acid sequence identify among eukaryotes. 

10 

ABC-transporters have been implicated in several human diseases. For 
instance, molecular efflux pumps of this type play a major role in the development of 
drug resistance exhibited by a variety of cancers and pathogenic microorganisms. In 
the case of human cancers, increased expression of the MDRl gene and related pumps 

1 5 have been observed to generate drug resistance to a broad collection of commonly 

used chemotherapeutics including doxorubicin, daunorubicin, vinblastine, vincristine, 
colchicines. In addition to the contribution these transporters make to the 
development of multi-drug resistance, there are currently 13 human genetic diseases 
associated with defects in 14 different transporters. The most common of these 

20 conditions include cystic fibrosis, Stargardt disease, age-related macular degeneration, 
adrenoleukodystrophy, Tangier disease, Dubin- Johnson syndrome and progressive 
familial intrahepatic cholestasis. For this reason, siRNAs directed against members of 
this, and related;'rfamilie& are potentially valuable research and therapeutic tools. 

25 With respect to channels, analysis of Drosophila mutants has enabled the 

initial molecular isolation and characterization of several distinct channels including 
(but not limited to) potassium (K+) channels. This list includes shaker (Sh), which 
encodes a voltage activated K"*^ channel, slowpoke (Slo), a Ca^^ activated channel, 
and ether-a-go-go (Bag). The Eag family is further divided into three subfamilies: 

30 Eag, Elk (eag-like K channels), and Erg (Eag related genes). 

The Erg subfamily contains three separate family members (Ergl-3) that are 
distantly related to the sh family of voltage activated channels. Like sh, erg 
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polypetides contain the classic six membrane spanning architecture of K"^ channels 
(S1-S6) but differ in that each includes a segment associated with the C-terminal 
cytoplasmic region that is homologous to cyclic nucleotide binding domains (cNBD). 
Like many isolated ion channel mutants, erg mutants are temperature-sensitive 
5 paralytics, a phenotype caused by spontaneous repetitive firing (hyperactivity) in 
neurons and enhanced transmitter release at the neuromuscular jimction. 

Initial studies on the tissue distribution of all three members of the erg 
subfamily show two general patterns of expression. Ergl and erg3 are broadly 

10 expressed throughout the nervous system and are observed in the heart, the superior 
mesenteric ganglia, the celiac ganglia, the retina, and the brain. In contrast, erg2 
shows a much more restricted pattern of expression and is only observed in celiac 
ganglia and superior mesenteric ganglia. Similarly, the kinetic properties of the three 
erg potassium channels are not homogeneous. Ergl and erg2 channels are relatively 

1 5 slow activating delayed rectifiers whereas the erg3 current activates rapidly and then 
exhibits a predominantly transient component that decays to a sustained plateau. The 
current properties of all three channels are sensitive to methanesulfonanilides, 
suggesting a high degree of conservation in the pore structure of all three proteins. 

20 Recently, the erg family of channels has been implicated in human disease. 

Consistent with the observation that ergl is expressed in the heart, single strand 
conformation polymorphism and DNA sequence analyses have identified HERG 
(human ergl) mutatioiis?iji?six long-QT-syndrome (LQT) families, an inherited 
disorder that results in sudden death fi-om a ventricular tachyarrythmia. Thus siRNA 

25 directed against this group of molecules (e.g. KCNHl-8) will be of extreme 
therapeutic value. 

Another group of channels that are potential targets of siRNAs are 
the CLCA family that mediate a Ca^'^-activated CP conductance in a variety of 
30 tissues. To date, two bovine (bCLCl ; bCLCA2 (Lu-ECAM-1)), three mouse 

(mCLCAl; mCLCA2; mCLCA3) and four human (hCLCAl; hCLCA2; hCLCA3; 
hCLCA4) CLCA family members have been isolated and patch-clamp studies with 
transfected human embryonic kidney (HEK-293) cells have shown that bCLCAl, 
mCLCAl, and hCLCAl mediate a Ca^^-activated Cl~ conductance that can be 
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inhibited by the anion channel blocker DIDS and the reducing agent dithiothreitol 
(DTT). 



The protein size, structure, and processing seem to be similar among different 
5 CLCA family members and has been studied in greatest detail for Lu-ECAM-1 . The 
Lu-ECAM-1 open reading frame encodes a precursor glycoprotein of 130 kDa that is 
processed to a 90-kDa amino-terminal cleavage product and a group of 30- to 40-kDa 
glycoproteins that are gjycosylation variants of a single polypeptide derived from its 
carboxy terminus. Both subunits are associated with the outer cell surface, but only 
10 the 90-kDa subunit is thought to be anchored to the cell membrane via four 
transmembrane domains. 

Although the protein processing and function appear to be conserved among 
CLCA homologs, significant differences exist in their tissue expression pattems. For 

15 example, bovine Lu-ECAM-1 is expressed primarily in vascular endothelia, bCLCAl 
is exclusively detected in the trachea, and hCLCAl is selectively expressed in a 
subset of human intestinal epithelial cells. Thus the emerging picture is that of a 
multigene family with members that are highly tissue specific, similar to the CIC 
family of voltage-gated Cl~ channels. The human channel, hCLCA2, is particular 

20 interesting from a medical and pharmacological standpoint. CLCA2 is expressed on 
the luminal surface of lung vascular endothelia and serves as an adhesion molecule 
for lung metastatic cancer cells, thus mediating vascular arrest and lung colonization. 
Expression of this moleculeti^fenprmal mammary epithelium is consistently lost in 
human breast cancer and in nearly all tumorigenic breast cancer cell lines. Moreover, 

25 re-expression of hCLCA2 in human breast cancer cells abrogates tumorigenicity in 
nude mice, implying that hCLCA2 acts as a tumour suppressor in breast cancer. For 
these reasons, siRNA directed against CLCA family members and related channels 
may prove to be valuable in research and therapeutic venues. 

30 Transporters Involved in Synaptic Transmission. 

Synaptic transmission involves the release of a neurotransmitter into the 
synaptic cleft, interaction of that transmitter with a postsynaptic receptor, and 
subsequent removal of the transmitter from the cleft. In most synapses the signal is 
terminated by a rapid reaccumulation of the neurotransmitter into presynaptic 
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terminals. This process is catalyzed by specific neurotransmitter tremsporters that are 
often energized by the electrochemical gradient of sodium across the plasma 
membrane of the presynaptic cells. 



5 Aminobutyric acid (GABA) is the major inhibitory neurotransmitter in the 

central nervous system. The inhibitory action of GABA, mediated through GABAa/ 
GABAb receptors, and is regulated by GABA transporters (GATs), integral 
membrane proteins located perisynaptically on neurons and glia. So far four different 
carriers (GAT1-GAT4) have been cloned and their cellular distribution has been 

10 partly worked out. Comparative sequence analysis has revealed that GABA 

transporters are related to several other proteins involved in neurotransmitter uptake 
including gamma-aminobutyric acid transporters, monoamine transporters, amino 
acid transporters, certain "orphan" transporters, and the recently discovered bacterial 
transporters. Each of these proteins has a similar 12 transmembrane helices topology 

15 and relies upon the Na+/Cl- gradient for transport function. Transport rates are 

dependent on substrate concentrations, with half-maximal effective concentrations for 
transport frequently occurring in the submicromolar to low micromolar range. In 
addition, transporter function is bidirectional, and non- vesicular efflux of transmitter 
may contribute to ambient extracellular transmitter levels. 

20 

Recent evidence suggests that GABA transporters, and neurotransmitter 
transporters in general, are not passive players in regulating neuronal signaling; rather, 
transporter function can bealtereffcby a variety of initiating factors and signal 
transduction cascades. In general, this functional regulation occurs in two ways, 

25 either by changing the rate of transmitter flux through the transporter or by changing 
the number of functional transporters on the plasma membrane. A recurring theme in 
transporter regulation is the rapid redistribution of the transporter protein between 
intracellular locations and the cell surface. In general, this functional modulation 
occurs in part through activation of second messengers such as kinases, phosphatases, 

30 arachidonic acid, and pH. However, the mechanisms xmderlying transporter 
phosphorylation and transporter redistribution have yet to be fully elucidated. 

GABA transporters play a pathophysiological role in a number of human 
diseases including temporal lobe epilepsy and are the targets of pharmacological 
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interventions. Studies in seizure sensitive animals show some (but not all) of the GAT 
transporters have altered levels of expression at times prior to and post seizure, 
suggesting this class of transporter may affect epileptogenesis, and that alterations 
following seizure may be compensatory responses to modulate seizure activity. For 
5 these reasons, siRNAs directed against members of this family of genes (including but 
not limited to SLCG6A1-12) may prove to be valuable research and therapeutic tools. 

Organic Ion Transporters. 

The human body is continuously exposed to a great variety of xenobiotics, via 
10 food, drugs, occupation, and environment. Excretory organs such as kidney, liver, and 
intestine defend the body against the potentially harmful effects of these compounds 
by transforming them into less active metabolites that are subsequently secreted from 
the system. 

15 Carrier-mediated transport of xenobiotics and their metabolites exist for the 

active secretion of organic anions and cations. Both systems are characterized by a 
high clearance capacity and tremendous diversity of substances accepted, properties 
that result from the existance of multiple transporters with overlapping substrate 
specificities. The class of organic anion transporters plays a critical role in the 

-20 elimination of a large number of drugs (eg,, antibiotics, chemotherapeutics, diuretics, 
nonsteroidal anti-inflammatory drugs, radiocontrast agents, cytostatics); drug 
metabolites (especially conjugation products with glutathione, glucuronide, glycine, 
sulfate, acetate); and toxicantStand#idMnetabolites (e.g., mycotoxins, herbicides, 
plasticizers, glutathione ^-conjugates of polyhaloalkanes, polyhaloalkenes, 

25 hydroquinones, aminophenols), many of which are specifically harmful to the kidney. 

Over the past couple of years the number of identified anion transporting 
molecules has grown tremendously. Uptake of organic Einions (OA~) across the 
basolateral membrane is mediated by the classic sodium-dependent organic anion 
30 transport system, which includes a-ketoglutarate (a-KG^~)/OA~ exchange via the 
organic anion transporter (OATl) and sodium-ketoglutarate cotransport via the 
NaVdicarboxylate cotransporter (SDCT2). The organic anion transporting polypetide, 
Oatpl, and the kidney-specific OAT-Kl and OAT-K2 are seen as potential molecules 
that mediate facilitated OA" efflux but could also be involved in reabsorption via an 
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exchange mechanism. Lastly the PEPTl and PEPT2 mediate luminal uptake of 
peptide drugs, whereas CNTl and CNT2 are involved in reabsorption of nucleosides 

The organic anion-transporting polypeptide 1 (Oatpl) is a Na^- and ATP- 
5 independent transporter originally cloned from rat liver. The tissue distribution and 
transport properties of the Oatpl gene product are complex. Oatpl is localized to the 
basolateral membrane of hepatocytes, and is found on the apical membrane of S3 
proximal tubules. Studies with transiently transfected cells (e.g, HeLa cells) have 
indicated that Oatpl mediates transport of a variety of molecules including 
10 taurocholate, estrone-3-sulfate, aldosterone, Cortisol, and others. The observed uptake 
of taurocholate by Oatpl expressed in X laevis oocytes is accompanied by efflux of 
GSH, suggesting that transport by this molecule may be glutathione dependent. 

Computer modeling suggests that members of the Oatp family are highly 
15 conserved, hydrophobic, and have 12 transmembrane domains. Decreases in 
expression of Oatp family members have been associated with cholestatic liver 
diseases and human hepatoblastomas, making this family of proteins of key interest to 
researchers and the medical community. For these reasons, siRNAs directed against 
OAT family members (including but not limited to SLC21 A2, 3, 6, 8, 9, 1 1, 12, 14^ 
20, 15, and related transporters) are potentially useful as research and therapeutic tools. 

Nucleoside transporters. 

Nucleoside transporters play^kej^roles in physiology and pharmacology. 
Uptake of exogenous nucleosides is a critical first step of nucleotide synthesis in 

25 tissues such as bone marrow and intestinal epithelium and certain parasitic organisms 
that lack de novo pathways for purine biosynthesis. Nucleoside transporters also 
control the extracellular concentration of adenosine in the vicinity of its cell surface 
receptors and regulate processes such as neurotransmission and cardiovascular 
activity. Adenosine itself is used clinically to treat cardiac arrhythmias, and nucleoside 

30 transport inhibitors such as dipyridamole, dileizep, and draflazine function as coronary 
vasodilators. 

In mammals, plasma membrane transport of nucleosides is brought about by 
members of the concentrative, Na^-dependent (CNT) and equilibrative, Na^- 
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independent (ENT) nucleoside transporter families. CNTs are expressed in a tissue- 
specific fashion; ENTs are present inmost, possibly all, cell types and are responsible 
for the movement of hydrophilic nucleosides and nucleoside analogs down their 
concentration gradients. In addition, structure/function studies of ENT family 
5 members have predicted these molecules to contain eleven transmembrane helical 
segments with an amino terminus that is intracellular and a carboxyl terminus that is 
extracellular. The proteins have a large glycosylated loop between TMs 1 and 2 and 
a large cytoplasmic loop between TMs 6 and 7. Recent investigations have implicated 
the TM 3-6 region as playing a central role in solute recognition. The medical 

10 importance of the ENT family of proteins is broad. In humans adenosine exerts a 

range of cardioprotective effects and inhibitors of ENTs are seen as being valuable in 
alleviating a variety of cardio/cardiovascular ailments. In addition, responses to 
nucleoside analog drugs has been observed to vary considerably amongst e.g. cancer 
patients. While some forms of drug resistance have been shown to be tied to the up- 

15 regulation of ABC-transporters (e.g. MDRl), resistance may also be the result of 

reduced drug uptake (i.e. reduced ENT expression). Thus, a clearer understanding of 
ENT transporters may aid in optimizing drug treatments for patients suffering a wide 
range of malignancies. For these reasons, siRNAs directed against this class of 
molecules (including SLC28A1-3, SLC29A1-4, and related molecules) may be useful 

20 «as therapeutic and research tools. 

Sulfate Transporters. 

All cells require inorganic sulfate^ar^pmial function. Sulfate is the fourth 
most abundant anion in human plasma and is the major source of sulfur in many 
25 organisms. Sulfation of extracellular matrix proteins is critical for maintaining 
normal cartilage metabolism and sulfate is an important constituent of myelin 
membranes found in the brain 

Because sulfate is a hydrophilic anion that cannot passively cross the lipid 
30 bilayer of cell membranes, all cells require a mechanism for sulfate influx and efflux 
to ensure an optimal supply. To date, a variety of sulfate transporters have been 
identified in tissues from many origins. These include the renal sulfate transporters 
(NaSi-1 and Sat-1), the ubiquitously expressed diastrophic dysplasia sulfate 
transporter (DTDST), the intestinal sulfate transporter (DRA), and the erythrocyte 
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anion exchanger (AEl). Most, if not all, of these molecules contain the classic 12 
transmembrane spanning domain architecture commonly found amongst members of 
the anion transporter superfamily. 



5 Recently three different sulfate transporters have been associated with specific 

human genetic diseases. Family members SLC26A2, SLC26A3, and SLC26A4 have 
been recognized as the disease genes mutated in diastrophic dysplasia, congenital 
chloride diarrhea (CLD), and Pendred syndrome (PDS), respectively. DTDST is a 
particularly complex disorder. The gene encoding this molecule maps to chromosome 

10 5q, and encodes two distinct transcripts due to altemative exon usage. In contrast to 
other sulfate transporters (e.g, Sat-1) anion movement by the DTDST protein is 
markedly inhibited by either extracellular chloride or bicarbonate. Impaired function 
of the DTDST gene product leads to undersulfation of proteoglycans and a complex 
family of recessively inherited osteochondrodysplasias (achondrogenesis type IB, 

1 5 atelosteogenesis type II, and diastrophic dysplasia) with clinical features includinjg but 
not limited to, dwarfism, spinal deformation, and specific joint abnormalities. 
Interestingly, while epidemiological studies have shown that the disease occurs in 
most populations, it is particularly prevalent in Finland owing to an apparent foimder 
effect. For these reasons, siRNAs directed against this class of genes (including but 

20 not limited to SLC26A1-9, and related molecules) may be potentially helpful in both 
therapeutic and research venues. 

Ion Exchangers -^-te==±r^ssc«^ 

Intracellular pH regulatory mechanisms are critical for the maintenance of 
25 countless cellular processes. For instance, in muscle cells, contractile processes and 
metabolic reactions are influenced by pH. During periods of increased energy 
demands and ischemia, muscle cells produce large amounts of lactic acid that, without 
quick and efficient disposal, would lead to acidification of the sarcoplasm. 

30 Several different transport mechemisms have evolved to maintain a relatively 

constant intracellular pH. The relative contribution of each of these processes varies 
with cell type, the metabolic requirements of the cell, and the local environmental 
conditions. Intracellular pH regulatory processes that have been characterized 
functionally include but are not limited to the Na^/H^ exchange, the Na(HC03)„ 
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cotransport, and the Na^-dependent and -independent Cl~/base exchangers. As 
bicarbonate and CO2 comprise the major pH buffer of biological fluids, sodium 
biocarbonate cotransporters (NBCs) are critical. Studies have shown that these 
molecules exist in numerous tissues including the kidney, brain, liver, comea, heart, 
5 and lung, suggesting that NBCs play an important role in mediating HCOa" transport 
in both epithelial as well as nonepithelial cells. 

Recent molecular cloning experiments have identified the existence of four 
NBC isoforms (NBCl, 2, 3 and 4) and two NBC-related proteins, AE4 and NCBE 

10 (Anion Exchanger 4 and Na-dependent Chloride-Bicarbonate Exchanger). The 

secondary structure analyses and hydropathy profile of this family predict them to be 
intrinsic membrane proteins with 12 putative treinsmembrane domains and several 
family members exhibit A^-linked glycosylation sites, protein kinases A and C, casein 
kinase II, and ATP/GTP-binding consensus phosphorylation sites, as well as potential 

1 5 sites for myristylation and amidation. AE4 is a relatively recent addition to this 
family of proteins and shows between 30-48% homology with the other family 
members. When expressed in COS-7 cells and Xenopus oocytes AE4 exhibits sodium- 
independent and DIDS-insensitive anion exchanger activity. Exchangers have been 
shown to be responsible for a variety of human diseases. For instance, mutations in 

20 three genes of the anion transporter family (SLC) are believed to cause known 
hereditary diseases, including chondrodysplasia (SLC26A2, DTD), diarrhea (A3, 
down-regulated in adenoma/chloride-losing diarrhea protein: DRA/CLD), and 
goiter/deafiiess syndrome (A4, pendrin). Moreo.ver^mutations in NaH-/HC03 co- 
transporters have also been associated with various human maladies. For these 

25 reasons, siRNAs directed against these sorts of genes (e.g. SLC4A4-10, and related 
genes) may be usefiil for therapeutic and research purposes. 

Receptors Involved in Synaptic Transmission 

In all vertebrates, fast inhibitory synaptic transmission is the result of the 
30 interaction between the neurotransmitters* glycine (Gly) and y-aminobutyric acid 

(GAB A) and their respective receptors. The strychnine-sensitive glycine receptor is 
especially important in that it acts in the maiimialian spinal cord and brain stem and 
has a well-established role in the regulation of locomotor behavior. 
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Glycine receptors display significant sequence homology to several other 
receptors including the nicotinic acetylcholine receptor, the aminobutyric acid 
receptor type A (GABAaR), and the serotonin receptor type 3 (5-HT3R) subunits. As 

5 members of the superfamily of ligand-gated ion channels, these polypeptides share 
common topological features. The glycine receptor is composed of two types of 
glycosylated integral membrane proteins (al-a4 and (3) arranged in a pentameric 
suprastmcture. The alpha subunit encodes a large extracellular, N-terminal domain 
that carries the structural determinants essential for agonist and antagonist binding, 

1 0 followed by four transmembrane spanning regions (TMl -TM4), with TM2 playing the 
critical role of forming the inner wall of the chloride channel. 

The density, location, and subunit composition of glycine neurotransmitter 
receptors changes over the course of development. It has been observed that the 

1 5 amount of GlyR gene translation (assessed by the injection of developing rat cerebral 
cortex mRNA into Xenopus oocytes) decreases with age, whereas that of GAB ARs 
increases. In addition, the type and location of mRNAs coding for GlyR changes over 
the course of development. For instance in a study of the expression of alpha 1 and 
alpha 2 subunits in the rat, it was observed that (in embryonic periods El 1-18) the 

20 mantle zone was scarce in the alpha 1 mRNA, but the germinal zone (matrix layer) at 
El 1-14 expressed higher levels of the message. At postnatal day 0 (PO), the alpha 1 
signals became manifested throughout the gray matter of the spinal cord. By contrast, 
the spinal tissues at PO exhibited the highest levels*ofralpha-2 mRNA, which 
decreased with the postnatal development. 

25 

In both, man and mouse mutant lines, mutations of GlyR subunit genes result 
in hereditary motor disorders characterized by exaggerated startle responses and 
increased muscle tone. Pathological alleles of the Glral gene are associated with the 
murine phenotypes oscillator (sp(f^) and spasmodic (spd). Similarly, a mutant allele 
30 of Glrb has been found to underly the molecular pathology of the spastic mouse (spa). 
Resembling the situation in the mouse, a variety of GLRAl mutant alleles have been 
shown to be associated with the human neurological disorder hyperekplexia or startle 
disease. For these reasons, siRNA directed against glycine receptors (GLRAl -3, 



90 



GLRB, and related molecules), glutamate receptors, GABA receptors, ATP receptors, 
and related neurotransmitter receptor molecules may be valuable therapeutic and 
research reagents. 



5 Proteases 

Kallikreins 

One important class of proteases are the kallikreins, serine endopeptidases that 
split peptide substrates preferentially on the C-terminal side of internal arginyl and 

10 lysyl residues. Kallikreins are generally divided into two distinct groups, plasma 
kallikreins and tissue kallikreins. Tissue kallikreins represent a large group of 
enzymes that have substantial similarities at both the gene and protein level. The 
genes encoding this group are frequently found on a single chromosome, are 
organized in clusters, and are expressed in a broad range of tissues (e,g. pancreas, 

1 5 ovaries, breast). In contrast, the plasma form of the enzyme is encoded by a single 
gene (e.g. KLK3) that has been localized to chromosome 4q34-35 in humans. The 
gene encoding plasma kallikrein is expressed solely in the liver, contains 1 5 exons, 
and encodes a glycoprotein that is translated as a preprotein called prekallikrein. 

20 Kallikreins are believed to play an important role in a host of physiological 

events For instance, the immediate consequence of plasma prekallikrein activation is 
the cleavage of high molecular weight kininogen (HK) and the subsequent liberation 
of bradykinin, a nine amino acid vasoactive peptide that is an important mediator of 
inflammatory responses. Similarly, plasma kallikrein promotes single-chain urokinase 

25 activation and subsequent plasminogen activation, events that are critical to blood 
coaggulation and wound healing. 



Disruptions in the function of kallikreins have been implicated in a variety of 
pathological processes including imbalances in renal function and inflammatory 
30 processes. For these reasons, siRNAs directed against this class of genes (e.g. KLKl- 
1 5) may prove valuable in both research and therapeutic settings. 
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ADAM Proteins 

The process of fertilization takes place in a series of discrete steps whereby the 
sperm interacts with, i) the cumulus cells and the hyaluronic acid extracellular matrix 
(ECM) in which they are embedded, ii) the egg's own ECM, called the zona pellucida 
5 (ZP), and iii) the egg plasma membrane. During the course of these interactions, the 
"acrosome reaction," the exocytosis of the acrosome vesicle on the head of the sperm, 
is induced, allowing the sperm to penetrate the ZP and gain access to the perivitelline 
space. This process exposes new portions of the sperm membrane, including the inner 
acrosomal membrane and the equatorial segment, regions of the sperm head that can 
10 participate in initial gamete membrane binding. 

The interactions of the gamete plasma membranes appear to involve multiple 
ligands and receptors and are frequently compared to leukocyte-endothelial 
interactions. These interactions lead to a series of signal transduction events in the 

15 egg, known as collectively as egg activation and include the initiation of oscillations 
in intracellular calcium concentration, the exit from meiosis, the entry into the first 
embryonic mitosis, and the formation of a block to polyspermy via the release of ZP- 
modifying enzymes from the egg's cortical granules. Ultimately, sperm and egg not 
only adhere to each other but also go on to undergo membrane fiision, making one 

20 cell (the zygote) from two. 

Studies on the process of sperm-egg interactions have identified a number of 
proteins that are crucial for fertilization. One class of proteins,: Galled the ADAM 
family (A Disintegrin And Metalloprotease), has been found to be important in 

25 spermatogenesis and fertilization, as well as various developmental systems including 
myogenesis and neurogenesis. Members of the family contain a disintegrin and 
metalloprotease domain (and therefore have (potentially) both cell adhesion and 
protease activities), as well as cysteine-rich regions, epidermal growth factor (EGF)- 
like domains, a transmembrane region, and a cytoplasmic tail. Currently, the ADAM 

30 gene family has 29 members and constituents are widely distributed in many tissues 
including the brain, testis, epididymis, ovary, breast, placenta, liver, heart, lung, bone, 
and muscle. 
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One of the best-studied members of the ADAM family is fertiUn, a 
heterodimeric protein comprised of at least two subunits, fertilin alpha and fertilin 
beta. The fertilin beta gene (ADAM2) has been disrupted with a targeting gene 
construct corresponding to the exon encoding the fertilin beta disintegrin domain. 
5 Sperm from males homozygous for disruptions in this region exhibit defects in 

multiple facets of sperm function including reduced levels of sperm transit from the 
uterus to the oviduct, reduced sperm-ZP binding, and reduced sperm-egg binding, all 
of which contribute to male infertility. 

10 Recently, four new ADAM family members (ADAM 24-27) have been 

isolated. The deduced amino acid sequences show that all four contain the complete 
domain organization common to ADAM family members and Northern Blot analysis 
has shown all four to be specific to the testes. SiRNAs directed against this class of 
genes (e.g. ADAM2 and related proteins) may be useful as research tools and 

15 therapeutics directed toward fertility and birth control. 

« 

Aminopeptidases 

Aminopeptidases are proteases that play critical roles in processes such as 
protein maturation, protein digestion in its terminal stage, regulation of hormone 

20 levels, selective or honieostatic protein tumover, and plasmid stabilization. These 

enzymes generally have broad substrate specificity, occur in several forms and play a 
major role in physiological homeostasis. For instance, the effects of bradykinin, 
angiotensin converting enzyme (ACE), and other vasoactive molecxiles^are muted by 
one of several peptidases that cleave the molecule at an intemal position and eliminate 

25 its ability to bind its cognate receptor (e.g. for bradykinin, the B2-receptor). 

Among the enzymes that can cleave bradykinin is the membrane bound 
aminopeptidase P, also referred to as aminoacylproline aminopeptidase, proline 
aminopeptidase; X-Pro aminopeptidase (eukaryote) and XPNPEP2. Aminopeptidase 
30 P is an aminoacylproline aminopeptidase specific for NH2-terminal Xaa-proline 

bonds. The enzyme i) is a mono-zinc-containing molecule that lacks any of the typical 
metal binding motifs found in other zinc metalloproteases, ii) has an active-site 
configuration similar to that of other members of the MG peptidase family, and iii) is 
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present in a variety of tissues including but not limited to the lung, kidney, brain, and 
intestine. 

Aminopeptidases play an important role in a diverse set of human diseases. 
5 Low plasma concentrations of aminopeptidase P are a potential predisposing factor 
for development of an^o-oedema in patients treated with ACE inhibitors, and 
inhibitors of aminopeptidase P may act as cardioprotectors against other forms of 
illness including, but not limited to myocardial infarction. For these reasons, siRNAs 
directed against this family of proteins (including but not limited to XPNPEPl and 
10 related proteins) may be useful as research and therapeutic tools. 

Serine Proteases 

One important class of proteases are the serine proteases. Serine proteases 
share a common catalytic triad of three amino acids in their active site (serine 

15 (nucleophile), aspartate (electrophile), and histidine (base)) and can hydrolyze either 
esters or peptide bonds utilizing mechanisms of covalent catalysis and preferential 
binding of the transition state. Based on the position of their introns serine proteases 
have been classified into a minimum of four groups including those in which 1) the 
gene has no introns interrupting the exon coding for the catalytic triad (e.g. the 

20 haptoglobin gene,); 2) eachzgene contains an intron just downstream from the codon 
for the histidine residue at the active site, a second intron downstream from the exon 
containing the aspartic acid residue of the active site and a third intron just upstream 
from the exon containing the serine of the active site {e.g. trypsinogen-^^.,.^^. 
chymotrypsinogen, kallikrein and proelastase); 3) the genes contain seven introns 

25 interrupting the exons coding the catalytic region (e,g, complement factor B gene); 
and 4) the genes contain two introns resulting in a large exon that contains both the 
active site aspartatic acid and serine residues (e.g. factor X, factor IX and protein C 
genes). 

30 Cytotoxic lymphocytes (e.g. CD8(+) cytotoxic T cells and natural killer cells) 

form the major defense of higher organisms against vims-infected and transformed 
cells. A key function of these cells is to detect and eliminate potentially harmful cells 
by inducing them to undergo apoptosis. This is achieved through two principal 
pathways, both of which require direct but transient contact between the killer cell and 
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its target. The first pathway involves ligation of TNF receptor-like molecules such as 
Fas/CD95 to their cognate ligands, and results in mobilization of conventional, 
programmed cell-death pathways centered on activation of pro-apoptotic caspases. 
The second mechanism consists of a pathway whereby the toxic contents of a 
5 specialized class of secretory vesicles are introduced into the target cell. Studies over 
the last two decades have identified the toxic components as Granzymes, a family of 
serine proteases that are expressed exclusively by cytotoxic T lymphocytes and 
natural killer (NK) cells. These agents are stored in specialized lytic granules and 
enter the target cell via endocytosis. Like caspases, cysteine proteases that play an 
1 0 important role in apoptosis, granzymes can cleave proteins after acidic residues, 
especially aspartic acid, and induce apoptosis in the recipient cell. 

Granzymes have been grouped into three subfamilies according to substrate 
specificity. Members of the granzyme family that have enzymatic activity similar to 

15 the serine protease chymotrypsin are encoded by a gene cluster termed the 'chymase 
locus'. Similarly, granzymes with trypsin-like specificities are encoded by the 
*tryptase locus', and a third subfamily cleaves after unbranched hydrophobic residues, 
especially methionine, and are encoded by the 'Met-ase locus*. All granzymes are 
synthesized as zymogens and, after clipping of the leader peptide, obtain maximal 

20 enzymatic activity subsequent Jto-the removal of an amino-terminal dipeptide. 

Granzymes have been found to be important in a number of important 
biological fianctions including defense against intracellular pathogens,-^^aft?versus 
host reactions, the susceptibility to transplantable and spontaneous malignancies, 
25 lymphoid homeostasis, and the tendency toward auto-inmiune diseases. For these 
reasons, siRNAs directed against granszymes {e.g. GZMA, GZMB, GZMH, GZHK, 
GZMM) and related serine proteases may be usefiil research and therapeutic reagents. 

Kinases 

30 Protein Kinases (PKs) have been implicated in a mmiber of biological 

processes. Kinase molecules play a central role in modulating cellular physiology and 
developmental decisions, and have been implicated in a large list of human maladies 
including cancer, diabetes, and others. 
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During the coxirse of the last three decades, over a hundred distinct protein 
kinases have been identified, all with presumed specific cellular fiinctions. A few of 
these enzymes have been isolated to sufficient purity to perform in vitro studies, but 
most remain intractable due to the low abundance of these molecules in the cell. To 
5 counter this technical difficulty, a nximber of protein kinases have been isolated by 
molecular cloning strategies that utilize the conserved sequences of the catalytic 
domain to isolate closely related homologs. Altematively, some kinases have been 
purified (and subsequently studied) based on their interactions with other molecules. 



10 p58 is a member of the p34cdc2 -related supergene family and contains a large 

domain that is highly homologous to the cell division control kinase, cdc2. This new 
cell division control-related protein kinase was originally identified as a component of 
semipurified galactosyltransferase; thus, it has been denoted galactosyltransferase- 
associated protein kinase (GTA-kinase). GTA-kinase has been found to be expressed 

15 in both adult and embryonic tissues and is known to phosphorylate a number of 

substrates, including histone HI , and casein. Interestingly enough, over expression of 
this molecule in CHO cells has shown that elevated levels of p58 result in a prolonged 
late telophase and an early Gl phase, thus hinting of an important role for GTA- 
kinase in cell cycle regulation. 

20 

Cvclin Dependent Kinases 

The cyclin-dependent kinases (Cdks) are a family of highly conserved 
*>o=i2=:a.\^-B... serine/threonine kinases that mediate many of the cell cycle transitions that:^OGCur 

during duplication. Each of these Cdk catalytic subunits associates with a specific 
25 subset of regulatory subunits, termed cyclins, to produce a distinct Cdk-cyclin kinase 
complex that, in general, fianctions to execute a unique cell cycle event. 



Activation of the Cdk cyclin kinases during cellular transitions is controlled by 
a variety of regulatory mechanisms. For the Cdc2 cyclin B complex, inhibition of 
30 kinase activity during S phase and G2 is accomplished by phosphorylation of two 
Cdc2 residues, Thr*^ and Tyr'^, which are positioned within the ATP-binding cleft. 
Phosphorylation of Thr^"^ and/or Tyr^^ suppresses the catalytic activity of the molecule 
by disrupting the orientation of the ATP present within this cleft. In contrast, the 
abrupt dephosphorylation of these residues by the Cdc25 phosphatase results in the 
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rapid activation of Cdc2 cyclin B kinase activity and subsequent downstream mitotic 
events. While the exact details of this pathway have yet to be elucidated, it has been 
proposed that Thr^'^/Tyr^^ phosphorylation functions to permit a cell to attain a critical 
concentration of inactive Cdk cyclin complexes, which, upon activation, induces a 
5 rapid and complete cell cycle transition. Furthermore, there is evidence in 
manunalian cells that Thr^'^/Tyr*^ phosphorylation also functions to delay Cdk 
activation after DNA damage. 

The Schizosaccharomyces pombe weel gene product was the first kinase 

10 identified that is capable of phosphorylating Tyr^^ in Cdc2. Homologs of the Weel 

kinase have been subsequently identified and biochemically characterized fi-om a wide 
range of species including human, mouse, frog, Saccharomyces cerevisiae, and 
Drosophila. In vertebrate systems, where Thr*"* in Cdc2 is also phosphorylated, the 
Weel kinase was capable of phosphorylating Cdc2 on Tyr^^, but not Thr^"^, indicating 

15 that another kinase was responsible for Thr*"^ phosphorylation. This gene, Mytl 

kinase, was recently isolated from the membrane fractions of Xenopus egg extracts 
and has been shown to be capable of phosphorylating Thr^'* and, to a lessor extent, 
Tyr^^ in Cdc2. A human Mytl homolog displaying similar properties has been 
isolated, as well as a non-membrane-associated molecule with Thr^"* kinase activity. 

20 .4:^ 

In the past decade it has been shown that cancer can originate from 
overexpression of positive regulators, such as cyclins, or from underexpression of 
j.-^h=s^.s«itt->. negative regulators {e,g, pi 6 (INK4a), pi 5 (INK4b), p21 (Cipl)). Inhibitors such^as-^... 

Mytl are the focus of much cancer research because they are capable of controlling 

25 cell cycle proliferation, now considered the Holy Grail for cancer treatment. For these 
reasons, siRNA directed against kinases and kinase inhibitors including but not 
limited to ABLl, ABL2, ACKl, ALK, AXL, BLK, BMX, BTK, C20orf64, CSFIR, 
SCK, DDRl, DDR2, DKFZp761P1010, EGFR, EPHAl, EPHA2, EPHA3, EPHA4, 
EPHA7, EPHA8, EPHBl, EPHB2, EPHB3, EPHB4. EPHB6, ERBB2, ERBB3, 

30 ERBB4, FER, FES, FGFRl, FGFR2, FGFR3, FGFR4, FGR, FLTl, FLT3, FLT4, 

FRK, FYN, HCK, IGFIR, INSR, ITK, JAKl, JAK2, JAK3, KDR, KIAA1079, KIT, 
LCK, LTK, LYN, MATK, MERTK, MET, MSTIR, MUSK, NTRKl, NTRK2, 
NTRK3, PDGFRA, PDGFRB, PTK2, PTK2B, PTK6, PTK7, PTK9, PTK9L, RET, 
RORl, ROR2, ROSl, RYK, SRC, SYK, TEC, TEK, TIE, TNKl, TXK, TYK2, 



97 



TYROS, YESl, and related proteins, may be useful for research and therapeutic 
purposes. 

G Protein Coupled Receptors 

One important class of genes to which siRNAs c£m be directed are G-protein 
coupled receptors (GPCRs). GPCRs constitute a superfamily of seven transmembrane 
spanning proteins that respond to a diverse array of sensory and chemical stimuli, 
such as light, odor, taste, pheromones, hormones and neurotransmitters. GPCRs play a 
central role in cell proliferation, differentiation, and have been implicated in the 
etiology of disease. 

The mechanism by which G protein-coupled receptors translate extracellular 
signals into cellular changes was initially envisioned as a simple linear model: 
activation of the receptor by agonist binding leads to dissociation of the heterotrimeric 
GTP-binding G protein (Gs, Gi, or Gq) into its alpha and beta/gamma subiuiits, both 
of which can activate or inhibit various downstream effector molecules. More 
specifically, activation of the GPCR induces a conformational change in the Ga 
subunit, causing GDP to be released and GTP to be bound in its place. The Ga and 
Gpy subunits then dissociate from the receptor and interact with a variety of effector 
molecules. For instance in the case of the Gs family, the primary function is to 
stimulate the intracellular messenger adenylate cyclase (AC), which catalyzes the 
conversion of cytoplasmic ATP into tiie secondary messenger cyclic AMP (cAMP). 
In contrast, the Gi family inhibits this pathway and the Gq family activates 
phospholipases C (PLC), which cleaves phosphatidylinositol 4,5, bisphosphate (PIP2) 
to generate inositol- 1,4,5-phosphate (IP3) and diacylglycerol (DAG). 

More recently, studies have shown that the functions of GPCRs are not limited 
to their actions on G-proteins and that considerable cross-talk exists between this 
diverse group of receptor molecules and a second class of membrane bound proteins, 
the receptor tyrosine kinases (RTKs). A number of GPCRs such as endothelin-1 , 
thrombin, bombesin, and dopamine receptors can activate MAPKs, a downstream 
effector of the RTK/Ras pathway. Interestingly, the interaction between these two 
families is not unidirectional and RTKs can also modulate the activity of signaling 
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pathways traditionally thought to be controlled exclusively by ligands that couple to 
GPCRs. For instance, EGF, which normally activates the MAPK cascade via the 
EGF receptor can stimulate adenylate cyclase activity by activating Gas. 

5 There are dozens of members of the G Protein-Coupled Receptor family that 

have emerged as prominent drug targets in the last decade. One non-limiting list of 
potential GPCR-siRNA targets is as follows: 

CMKLRl 

10 CMLl/ CMKLRl (Accession No. Q99788) is a member of the chemokine 

receptor family of GPCRs that may play a role in a number of diseases including 
those involved in inflammation and immunological responses {e.g. asthma, arthritis). 
For this reason, siRNA directed against this protein may prove to be important 
therapeutic reagents. 

15 

Studies of juvenile-onset neuronal ceroid lipofuscinosis (JNCL, Batten 
disease), the most common form of childhood encephalopathy that is characterized by 
progressive neural degeneration, show that it is brought on by mutations in a novel 
lysosomal membrane protein (CLN3). In addition to being implicated in JNCL, 

20 CLN3 (GPCR-like protein, Accession No. A57219) expression studies have shown 
that the CLN3 mRNA and protein are highly over-expressed in a number of cancers 
{e.g. glioblastomas, neuroblastomas, as well as cancers of the prostate, ovaries, breast, 
-V :«r--.ii^*- and colon) suggesting a possible contribution of this gene to tumor growth. For this ^ ^ 
reason, siRNA directed against this protein may prove to be important therapeutic 

25 reagents. 

CLACR 

The calcitonin receptor (CTR/ CALCR, Accession No. NM__001742) belongs 
to "family B" of GPCRs which typically recognized regulatory peptides such as 
30 parathyroid hormone, secretin, glucagons and vasoactive intestinal polypeptide. 

Although the CT receptor typically binds to calcitonin (CT), a 32 amino acid peptide 
hormone produced primarily by the thyroid, association of the receptor with RAMP 
(Receptor Activity Modulating Protein) enables it to readily bind other members of 
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the calcitonin peptide family including amylin (AMY) and other CT gene-related 
peptides (e.g, aCGRP and PCGRP). While the primary function of the calcitonin 
receptor pertains to regulating osteoclast mediated bone resorption and enhanced Ca^^ 
excretion by the kidney, recent studies have shown that CT and CTRs may play an 
5 important role in a variety of processes as wide ranging as embryonic/foetal 

development and sperm function/physiology. In addition, studies have shown that 
patients with particular CTR genotypes may be at higher risk to lose bone mass and 
that this GPCR may contribute to the formation of calcium oxalate urinary stones. For 
this reason, siRNA directed against CTR may be useful as therapeutic reagents. 

10 

OXTR 

The human oxytocin receptor (OTR, OXTR) is a 389 amino acid polypeptide 
that exhibits the seven transmembrane domain structure and belongs to the Class-I 
(rhodopsin-type) family of G-protein coupled receptors. OTR is expressed in a wide 

15 variety of tissues throughout development and mediates physiological changes 

through G(q) proteins and phospholipase C-beta. Studies on the functions of oxytocin 
and the oxytocin receptor have revealed a broad list of duties. OT and OTR play a 
role in a host of sexual, matemal and social behaviors that include egg-laying, birth, 
milk-letdown, feeding, grooming, memory and learning. In addition, it has been 

20 hypothesized that abnormalities in the functionality of oxytocin-OTR receptor-ligand 
system can lead to a host of irregularities including compulsive behavior, eating 
disorders (such as anorexia), depression, and various forms of neurodegenerative 
diseases. For these reasons, siRNA directed against this gene (NM_000916) may play tteHix-^,. 
an important role in combating OTR-associated illnesses. 

25 

EDG GPCRs 

Lysophosphatidic acid and other lipid-based hormones/growth factors induce 
their effects by activating signaling pathways through the G-protein coupled receptors 
(GPCRs) and have been observed to play important roles in a number of human 
30 diseases including cancer, asthma, and vascular pathologies. For instance, during 
studies of immunoglobulin A nephropathy (IgAN), researchers have observed an 
enhanced expression of EDG5 (NP_004221) suggesting a contribution of this gene 
product in the development of IgAN. For that reasons, siRNA directed against Edg5 
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(NM_004230), Edg4 (NM_004720), Edg7 (Nm_012152) and related genes may play 
an important role in combating human disease. 

Genes Involved in Cholesterol Signaling and Biosynthesis 
5 Studies on model genetic organisms such as Drosophila and C elegans have 

led to the identification of a plethora of genes that are essential for early development. 
Mutational analysis and ectopic expression studies have allowed many of these genes 
to be grouped into discreet signal transduction pathways and have shown that these 
elements play critical roles in pattem formation and cell differentiation. Disruption of 
10 one or more of these genes during early stages of development frequently leads to 
birth defects whereas as alteration of gene function at later stages in life can result in 
tumorigenesis. 

One critical set of interactions known to exist in both invertebrates and 

15 vertebrates is the Sonic Hedgehog-Patched-Gli pathway. Originally documented as a 
Drosophila segmentation mutant, several labs have recently identified human and 
mouse orthologs of many of the pathways members and have successfully related 
disruptions in these genes to known diseases. Pathway activation is initiated with the 
secretion of Sonic hedgehog. There are three closely related menibers of the Shh 

20 family (Sonic hedgehog, Desert, and Indian) with Shh being the most widely 

expressed form of the group. The Shh gene product is secreted as a small pro-signal 
molecule. To successfully initiate its developmental role, Shh is first cleaved, 
--whereupon the N-terminal tmncated fragment is covalently modified with cholesterol. 
The addition of the sterol moiety promotes the interaction between Shh and its 

25 cognate membrane bound receptor. Patched (Ptch). There are at lesast two isoforms of 
the Patched gene, Ptchl and Ptch2. Both isoforms contain a sterol-sensing domain 
(SSD); a roughly 180 amino acid cluster that is found in at least seven different 
classes of molecules including those involved in cholesterol biosynthesis, vesicular 
traffic, signal transduction, cholesterol transport, and sterol homeostasis. In the 

30 absence of Shh, the Patched protein is a negative regulator of the pathway. In contreist, 
binding of Shh-cholesterol to the Patched receptor releases the negative inhibition 
which that molecule enforces on a G-protein coupled receptor known as Smoothened. 
Subsequent activation of Smoothened (directly or indirectly) leads to the triggering of 
a trio of transcription factors that belong to the Gli family. All three factors are 



101 



relatively large, contain a characteristic C2-H2 zinc-finger pentamer, and recognize 
one of two consensus sequences (SEQ. ID NO. 0463 GACCACCCA or SEQ. ID NO. 
0464 GAACCACCCA). In the absence of Shh, Gli proteins are cleaved by the 
proteosome and the C-terminally truncated fragment translocates to the nucleus and 
5 acts as a dominant transcription repressor. In the presence of Shh-cholesterol, Gli 
repressor formation is inhibited and full-length Gli ftmctions as a transcriptional 
activator. 

Shh and other members of the Shh-PTCH-Gli pathway are expressed in a 
10 broad range of tissues (e,g, the notochord, the floorplate of the neural tube, the brain, 
and the gut) at early stages in development. Not surprisingly, mutations that lead to 
altered protein expression or function have been shown to induce developmental 
abnormalities. Defects in the human Shh gene have been shown to cause 
holoprosencephaly, a midline defect that manifests itself as cleft lip or palate, CNS 
15 septation, and a wide range of other phenotypes. Interestingly, defects in cholesterol 
biosynthesis generate similar Shh-like disorders (e,g, Smith-Lemli-Opitz syndrome) 
suggesting that cholesterol modification of the Shh gene product is crucial for 
pathway function. Both the Patched and Smoothened genes have also been shown to 
be clinically relevant with Smoothened now being recognized as an oncogene that, 
20 like PTCH-1 and PTCH-2, is believed to be the causative.,agent of several forms of 
adult tumors. For these reasons, siRNA directed against Smoothened (SMO, 
NM_005631), Patched (PTCH, nm_000264), and additional genes that participate in 
choleste^oUsignaling, biosynthesis, and degradation, have potentially useful research . - 
and therapeutic applications. 

25 

Targeted Pathways . 

In addition to targeting siRNA against one or more members of a family of 
proteins, siRNA can be directed against members of a pathway. Thus, for instance, 
siRNA can be directed against members of a signal transduction pathway {e.g. the 
30 insulin pathway, including AKTl-3, CBL, CBLB, EIF4EBP1, FOXOl A, F0X03A, 
FRAPl, GSK3A, GSK3B, IGFl, IGFIR, INPP5D, INSR, IRSl, MLLT7, PDPKl, 
PIK3CA, PIK3CB, PIK3R1, PIK3R2, PPP2R2B, PTEN, RPS6, RPS6KA1, 
RPX6KA3, SGK, TSCl, TSC2, AND XPOl), an apoptotic pathway (CASP3,6,7,8,9, 
DSHl/2, PI 10, P85, PDKl/2, CATENIN, HSP90, CDC37, P23, BAD, BCLXL, 
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BCL2, SMAC, and others), pathways, involved in DNA damage, cell cycle, and other 
physiological (p53,MDM2, CHKl/2, BRCAl/2, ATM, ATR, P15INK4, P27, P21, 
SKP2, CDC25C/A, 14-3-3, PLK, RB, CDK4, GLUT4, Inos, Mtor, FKBP, PPAR, 
RXR, ER). Similarly, genes involved in immune system function including TNFRl, 
5 IL-IR, IRAKl/2, TRAF2, TRAF6, TRADD, FADD, IKKe, IKKy, IKKp, IKKa, 

IkBa, IkBp, p50, p65, Rac, RhoA, Cdc42, ROCK, Pak 1/2/3/4/5/6, cIAP, HDACl/2, 
CBP, p-TrCP, Rip2/4, and others are also important targets for the siRNAs described 
in this document and may be useful in treating immune system disorders. Genes 
involved in apoptosis, such as Dshl/2,PTEN, PI 10 (pan), P85, PDKl/2, Aktl, Akt2, 

10 Akt (pan), pTO^^*^, GSK3p, PP2A (cat), P-catenin, HSP90, Cdc37/p50, P23, Bad, 
BclxL, Bcl2, Smac/Diablo, and Askl are potentially useful in the treatment of 
diseases that involve defects in programmed cell death (e.g. cancer), while siRNA 
agents directed against p53, MDM2, Chkl/2, BRCAl/2, ATM, ATR, pl5*^^^, P27, 
P21, Skp2, Cdc25C/A, 14-3-3a/£, PLK, Rb, Cdk4, Glut4, iNOS, mTOR, FKBP, 

1 5 PPARy, RXRa, ERa and related genes may play a critical role in combating diseases 
associated with disruptions in DNA repair, and cell cycle abnormalities. 

Tables VI -Table X below provide examples of useful pools for inhibiting 
different genes in the human insulin pathway and tyrosine kinase pathways, proteins 

20 involved in the cell cycle, the production of nuclear receptors^and other genes. These 
particular pools are particularly useful in humans, but would be useful in any species 
that generates an appropriately homologous mRNA. Further, within each of the listed 
pools any one "sequence maybe used independently but preferably at least two of the 
listed sequences, more preferably at least three, and most preferably all of the listed 

25 sequences for a given gene is present. 



Table VI 



Gene 
Name 


Acc# 


Gl 


L.L. 


Duplex # 


Sequence 


SEQ. 
ID NO. 


AKT1 


NM 005163 


4885060 


207 


D-003000-05 


GACAAGGACGGGCACATTA 


465 


AKT1 


NM 005163 


4885060 


207 


D-003000-06 


GGAC/\AGGACGGGCACATT 


466 


AKT1 


NM 005163 


4885060 


207 


D-003000-07 


GCTACTTCCTCCTC/\AGAA 


467 


AKT1 


NM 005163 


4885060 


207 


D-003000-08 


GACCGCCTCTGCTTTGTCA 


468 


AKT2 














AKT2 


NM 001626 


6715585 


208 
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GTAGTTGGATGATGAATTT 


469 


AKT2 


NM 001626 


6715585 


208 


D-003001-06 


GGAAAGAGGGGATGAGTGA 


470 


AKT2 


NM 001626 


6715585 


208 


D-003001-07 


GGGCTAAAGTGACCATGAA 


471 
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Klr\oK 1 


Kl^/I i Qi CA>I 

NM it>10U4 


00/1CCOC4 


COAC 

b29b 


AAor\o/^ e\c 

D-003020-05 


GGAAATATGGCTTCTCTGA 


545 


DIl/'QDi 


Kl^>l 'I 04 CA>I 

N M^l o 1 0-OfL^. 


OZ455251 


cone 

5295 


D-003020-Od 


AAA A /^/^ A A A /^/^ A A TT A 

GAAAGACGAGAGACCAATA 


546 


Klr\oK 1 


Klhil i Qi CAvl 

NM 1o1dU4 


oo A CCOC 4 


5295 


U-003020-07 


^^^r AAA A "1 1 ^^^r/^^r/"^ a a a 

GT AAAG C ATTGTGTC AT AA 


547 


DII/'QPi 

r IrwoKI 


KIK/I i Qi CA>I 

NM 1oioU4 


oo >1 C COC -1 

o^4bbzb1 


COAC 

5295 


U-OO0O2O-O0 


/^/^ AT/"^ A A /"^ T T /"^T/*^ AAA O A A 

GG ATCAAGTTGTCAAAG AA . 


548 


PIL^QPO 

Klr\oK^ 
















Klfcyi AACAOT 

NM UUOUZ/ 


yi QOCAA7 

4ozoyu/ 


COAA 

529b 


r\ AAOAOH AC 
D-OO0O2I-O5 


/"^/^ A A A /^y^/^/"^0/^ A A/"^AATA A 

GGAAAGGCGGGAACAATAA 


C >l o 

549 


piixopo 


M^/l AACAOT 
NM UUDU^f 


/IQOCQA7 

4o^oyu f 


COAA 

b29b 


r> AAOAOi AA 

U-00o021-0b 


GA FGAAGGGTACTGCAATT 


550 


PIl^QPO 

nlrxorxZ 


KIK^ AAt^A07 

NM UUOU^/ 


A QOt5QA7 


COAA 

b29b 


AAOAOi A7 

U-OU0U2I -07 


GGAGAGCGAATOTCACTAC 


CC 4 

551 


piL/'opo 


MK>i AAt^AOT 
NM UUDU^/ 


>1 QOftQA7 
4o^DyU t 


COAA 

o2yb 


|-\ AAOAO-I AQ 

U-UU0U21 -uo 


GCAAGA 1 CCGAGAOOAGTA 


ceo 

552 


pppopoD 














PPPOPQR 


MK^ AA/icyfj 
NM UU40/D 


4f ooyoo 


CCOi 

o021 


AAOAOO AC 

U-UUoU22-Ub 


A AX/^/^A/^/^l TA/^X I 1 O 1 '1 

GAA 1 GGAGU 1 1 AG 1 1 1 G 1 1 


ceo 

553 


PPPOPOR 


M^4 AA>1*^7<5 
NM UU40fO 


/1 7CfiQCO 

4 ooyoo 


CCOi 

o021 


AAOAOO AA 

U-UUoU22-0b 


GAGGGAAGGTGACATTATG 


C C >l 

554 


PPPOPOR 


NM UU40/D 


4f ooyoo 


CCOi 

0021 


AAOAOO A7 
U-UU0O22-O / 


X/^/^ AXX A O/^X/^ A A A OXXT 

1 GGA 1 r AGG I GAAGAGTTT 


555 


PPPOPOR 


NM UU40fD 


/l7CfiOC'3 

4/ ooyoo 


CCOi 

0021 


n AAOAOO AQ 
U-UU0U22-O0 


/^/^X/^ A A/^ A/^TTXX AO A A AX A 

GG 1 GAAGAG \ 1 lAGAAATA 


CCA 

556 


PTEN 














PTEN 


NM 000314 


4506248 


5728 


D-003023-05 


GTGAAGATCTTGACCAATG 


557 


PTEN 


NM 000314 


4506248 


5728 


D-003023-06 


GATCAGCATACACAAATTA 


558 


PTEN 


NM 000314 


4506248 


5728 


D-003023-07 


GGCGCTATGTGTATTATTA 


559 


PTEN 


NM 000314 


4506248 


5728 


D-003023-08 


GTATAGAGCGTGCAGATAA 


560 


RPS6 















105 





INIVI UUiUlU 


1 / 1 OoU4o 


ftl CIA 

01 y4 


pv AA'iAOyf AC 


/^/^/^ A/^ AAA P*TOA 1 1 A A/^X 


bbi 


r\"OD 


INIVI UU1U1U 


1/1 OoU4o 


Di y4 


pv AAQAOyl Aft 

U-UUoUz4-Ub 


r^/^ AX AXX/^/^X/^/^ A/^X/^ A^X 

VjoA I a I I LrO 1 oVjAO I oAU 1 


CftO 

bb^ 




INM UUiUlU 


1/1 00U4O 


D 1 y4 


r\ AA'iAO/l A7 


/^/^AA/^/^A/^A Af^T/^r^ A^ A A A 

OUAAvjIjAoAAO I ooAljAAA 


CftQ 
ODO 




Kl^/l nrnn-in 


1/1 OoU40 


Di y4 


r\ AAQAO/I AQ 


Or^X AX/^/^/^/^ A /^ A A A /^XX 

oUVj I A 1 vjQjUOAUALiAAo 1 1 


Ob4 


r\r^oOr\/-\ 1 
















INIVI UUZ9OU 


ZUl 4y04D 


DI yo 


pv AA'JAOC AC 

u-uuoUzb-Ub 


A T/^ A /^ A /^/**XX/^X A /"*TXX/^ 

vjA I oAOAOO 1 1 U 1 AU 1 II 0 


CftC 

bbb 




INm UUZb^Do 


ZU14yo40 


Diyo 


pv AAQAOC Aft 

u-uuouzo-Ub 


A/^ A AX/^/^/^/^Xi^/^X/^ AX/^ A 

VjAoAA I vjCjQjO I I UA 1 0 A 


CCft 

bbb 




INM uuzyoo 


ZU14yo4D 


01 yo 


AAOAOC A7 

U-UUou2b-U/ 


/"» A A at/^/^xx/^ a/^ a a 
UAACjUoCjjCjjA I 00 1 1 LfACjAA 


Ob/ 




iNM uuzyoo 


ZUl 4yo4D 


Diyo 


P\ AAOAOC AQ 


OOAOCGGCCTCjATGGAAGA 


coo 
ODO 


















INM UU400D 


4/ oyu4y 


biy / 


P* AAQAOft AC 

u~uuoUzb-ub 


GAAGGGAAGTTGTATCTTA 


con 

oo9 


PPQOU' AO 


INM UU400D 


4 / oy U4y 


biy / 


P^ AAOAOft Aft 

U-UUo02b-Ub 


AAA /^X AT/^T/^T AT/^TT A /^T 

GAAAGTATGTGTATGTAGT 


570 


DPQCl^' AO 


INM UU4DoO 


4 / oy U4y 


biy/ 


P\ riAOAOft A "7 


A A/^/^ AT/^^ AAA ATT A 

GGACAGCATCCAAACATTA 


571 


PPQRk' A'^ 


INM UU4000 


4/oyu4y 


biy/ 


P* AAOAOft AQ 

U-UUoU2b-Uo 


r^r^ A/^/^T/^ A ATT/^^T/^/^ ATA 

GGAGGTGAATTGCTGGATA 


572 


oor\ 
















INM KJUOOZf 


ouozuyu 


b44b 


pv r4r\0A07 Ad 
U-UUoU2/-Ul 


XX A ATr^r^~rr^r^ a/^ aoxt/^xx 
1 1 AATGGTGGAGAGTTGTT 


573 


OOl\ 


INM UUoOZf 


co'iOAQn 

ouozuyu 


D44d 


pv r»noA07 A>i 
U-UUoU2/ -U4 


A T — r A A r^~r/^ a t/^ a t/^t/^ a 
A 1 1 AAU 1 GGGA 1 GA 1 LrlOA 


574 


ool\ 


INM UUobZ/ 




dA Ad 

b44b 




/"* A A AAA A A T/^/^Ty^ AAA 

G AAG AAAG C AATCCTG AAA 


575 




INM UuOO^f 


zbi bozbz 


D44b 


D-003027-Od 


AAA ^ A A /^/^T/^ AAA T^>T A ^ 

AAACACAGCTGAAATGTAC 


576 
















1 


INM UuUoOo 


Zhh I bb^b 


/^4o 




A A AT/**/**/^T ATT/^T/^T/^T 

G AAG ATG GCT ATTCTGTGT 


577 


1 1 


INM UUUoDO 


Z44/ obZb 


-7O/I Q 

/ Z4o 


L)-UU*5U2o~0b 


T A T/~* A A /^/^T/^/^ A A /^TT A 

TATGAAGGCTCGAGAGTTA 


f— -7 o 

578 


1 oOl 


NM UUUoOO 


z44 / bbzb 


/24o 


n\ rvrvo/^00 rf^"7 

D-00302o-07 


CGACACGGCTGATAACTGA 


579 


1 oOl 


NM UUUoOo 


Z44 / bb^b 


*70>l Q 

/z4o 


U-00302o-0o 


CGGCTGATGTTGTTAAATA 


580 


1 oOZ 
















K.i^>l nr\f\cz.AQ 
INM UUUo4o 


1 UyooUUb 


/ z4y 


n\ AAorvorv rvc 


A 1 1 A A T/^T/^T — I" A /** A T A 

GCATTAATCTCTTACCATA 


581 


1 oOz 


NM UUUb4o 


1 oyooOOD 


7249 


D-003029-06 


/-x/-^ A AT/^T/^^T^ 1 1 f^~n 1 1 ■ 

CCAATGTCCTCTTGTCTTT 


582 




NM UUU04o 


1 OyooOOb 


7249 


D-003029-07 


A A A A T/^ A f~^~r A ^ I 1 

GGAGACACATCACCTACTT 


583 


TSC2 


KIM ono^dR 




794Q 


L^-uuouzy-uo 


Tr^Ar^PAf^rioxoAXOA Ar:: AA 


Oo4 


XP01 














XP01 


NM 003400 


8051634 


7514 


D-003030-05 


GAAAGTCTCTGTCAAAATA 


585 


XP01 


NM 003400 


8051634 


7514 


D-003030-06 


GCAATAGGCTCCATTAGTG 


586 


XP01 


NM 003400 


8051634 


7514 


D-003030-07 


GGAACATGATCAACTTATA 


587 


XP01 


NM 003400 


8051634 


7514 


D-003030-08 


GGATACAGATTCCATAAAT 


588 



Table VII 



Gene 
Name 


Acc# 


Gl 


L.L 


Duplex # 


Sequence 


SEQ. 
ID NO. 


ABL1 














ABL1 


NM 007313 


6382057 


25 


D-0031 00-05 


GGAAATCAGTGACATAGTG 


589 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-06 


GGTCCACACTGCAATGTTT 


590 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-07 


GAAGGAAATCAGTGACATA 


591 


ABL1 


NM 007313 


6382057 


25 


D-0031 00-08 


TCACTGAGTTCATGACCTA 


592 


ABL2 














ABL2 


NM 007314 


6382061 


27 


D-0031 01 -05 


GAAATGGAGCGAACAGATA 


593 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -06 


GAGCCAAATTTCCTATTAA 


594 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -07 


GTAATAAGCCTACAGTCTA 


595 


ABL2 


NM 007314 


6382061 


27 


D-0031 01 -08 


GGAGTGAAGTTCGCTCTAA 


596 


ACK1 














ACK1 


NM 005781 


8922074 


10188 


D-0031 02-05 


AAACGCAAGTCGTGGATGA 


597 


ACK1 


NM_005781 


8922074 


10188 


D-0031 02-06 


GCAAGTCGTGGATGAGTAA 


598 


ACK1 


NM 005781 


8922074 


10188 


D-0031 02-07 


GAGCACTACCTCAGAATGA 


599 



106 



MV^rv 1 


INIVI UUDfOI 




1 ni Qft 
1 U1 oo 


u-uuon UZ-Uo 


1 OAoLfAV3V^ML/L/L>Al^ 1 A 1 1 A 


RAO 
OUU 


Al U 
MLix 
















INIVI UU^oU*f 


zyuzyoo i 


zoo 


r\ 00110*1 o*^ 
U-UUO 1 uo-uo 


r^Ar^ A ahl ATr^r^Tr^r^ An A at a 

oAv^AAoA 1 1 oOAoAA 1 A 


R01 
OUl 


Al VC 


INIVI UU*foU^ 


zyUicyoo 1 


zoo 


p\ 00010Q OR 

U-UUO 1 uo-uo 


ooAAoAo 1 O 1 ooOAvj 1 1 oA 


R09 
OUZ 


Al W 


KIM 004*^04 
INIVI uu**ouh- 


9Qn9QR'^1 

zyu^yoo 1 


zoo 


p\ 00*^10*^ 07 
U-UUO 1 UO-U ( 


nn An nTnnnTnnnn An ATT 


OUO 


Al i( 


INIVI UU'fOU'f 


Z9UZv700 1 


zoo 


n nono'^ Oft 

U-UUO 1 UO-Uo 


nAAnTnnAnTnAAnnAAnA 

oAAVur 1 oOAo 1 oAAooAALrA 


OU4 


AYI 














AYI 


MM 09101*^ 

INIVI \j£, 1 9 1 O 


Z 1 OOO'l-DO 


OOo 


pi noQi o/i o<^ 

U-UUO 1 U4-UO 


n n Tn An An nTn n Ann a ttt 
Vjo 1 OAoAoO 1 ovsAooA 1 1 1 


ROC 
OUO 


AYI 




91 CI'^R/IR'^ 
Z 1 Ov3D4DO 


OOo 


rv OOQ1 O/I OR 

U-UUO 1 U4-UD 


nAAAnAAnnAn AnnnnTT a 
oAAAoAAoIjAvjAOL/oo 1 1 A 


ROR 
OUO 


AYI 
MAL 


INIVI uz 1 y 1 o 


91 ci'^R/lAc: 
Z 1 DOO^DO 


OOO 


r\ noQio/i 07 
u-uuon U4-U / 


nn A An A An A~r/^~r Ar^ a at nn 
OV^AAoAAVjA 1 U 1 Ai^AA 1 oo 


C07 
OU f 


AYI 
MAL 


KIM nOiOi'i 
iNlVi \}£.\\i\0 


ZlOOD*l'DO 


OOo 


U-UUol U4-UO 


nn A AnTnn ATnnTn a a t/^ a 
oijAAO 1 (jOA 1 (jU 1 oAAT(jA 


CAQ 

bUo 


Rl VC 














Rl 

DLI\ 


iNlVI UU1 f ID 


hDUZ*!-! Z 


D4U 


A004 0C AC 

U-UUol UO-Uo 


n a nn ATnnnTnnTnn a ttt 


CAA 

buy 


Rl IC 


INIVI UU 1 /ID 


'tOUZ'f 1 Z 


04U 


p\ AA01AC AC 

U-UUol Uo-Ub 


An ATn A AnnTnnnn a tt a a 
AUA 1 (jAAoCj 1 vjCjUUA 1 1 AA 


CH A 

blU 


Rl W 


MM nniTi^^ 

INIVI UU 1 f 1 O 


*fOUZH 1 Z 


04U 


p\ OOQ1 OC A7 

U-UUO 1 UO-U/ 


nnTn Annnnnn AAnAnAAn 
Kd\:j 1 OAoOoOLrOAAvjALrAAo 


CI 1 

bl 1 


Rl k' 


MM nOITiR 
INIVI UU 1 r 1 0 


HOUZ** 1 Z 


D4U 


U-UUO 1 UO-Uo 


A A AnTnnnnTnTnn An a a 
vjAAAO 1 Oooo 1 O 1 IjoAUAA 


biz 


RMY 














rK/iy 

DIVIA 


Kl^A nniTOi 

INM UU 1 f Z 1 


91 QfxQQII 

z 1 ooyool 


DDU 


r> 00'510R AC 

U-UUol Ub-Uo 


A A An A A AnnTTTnnT A nT A 
AAAUAAAL/U 1 1 1 1 AO 1 A 


bio 


tDlVIA 


NM UUl ^1 


zn ooyoo 1 


boU 


p \ AAO-1A£5 AC5 

U-UUol Ub-Ub 


AACjCjACjUA 1 1 1 A 1 Cjo 1 1 A 


614 


R^^Y 


M^^ nni T9i 

INIVI UU 1 / Z 1 


91 '^t^QQ'il 

zn ooyooi 


OuU 


PI AA11AR A7 

U-UUol UO-U/ 


n An A An An a i i Ann t i n i \ 
oAoAAvj AvjA i 1 AUO Moll 


bl O 


c5MA 


INM UUl f ^1 


z 1 ooyooi 


DOU 


r\ AAO'1A£5 AQ 

U-UUol Ub-Uo 


nT A AnnnT^T^ a at/^ at a a 
(j T AACjGCTGTGAATG ATAA 


616 


RTW 
D 1 W 














RTk 
D 1 l\ 


INM UUUUOl 


400/ O/D 


byo 


pv AAO'1A7 AC 

U-UUol U/-UO 


^ a a ^ A f^/^ A AT/-*/^ A A /^/^ 1 1 A 

GAAOAGGAA 1 GGAAoL/ 1 1 A 


617 


D i l\ 


INM UUUUDi 


'I'DOf Of O 


oyo 


P\ AA01A7 AC 

U-UUo1U/-Ub 


nnT ATnnnnTnnn A a attt 
1 A 1 oGGO 1 oOOAAA 1 1 1 


C'l Q 

61 o 


RTk 
D 1 rV 


K|K4 nnnn^i 
INM UUUUDl 


400/ O/ o 


ROC 

oyo 


r\ AAQI A7 A7 

U-UUO 1 Uf-Uf 


/^AAA/^OAA /"^TT Ann ATnnT 
oAAAGUAAU 1 1 AUL/A 1 GG 1 


C-l A 

biy 


RTk 
D 1 1\ 


INM UUUUDI 


400f O/D 


oyo 


r\ AAQ'1A7 AQ 

U-UUol Uf-Uo 


o/^TA A Ann ATn A Ann An i i 
GG 1 AAAUGA i GAAGGAG 1 1 


COA 

6zU 


OjiUOrTD4 
















NM UOOOOU 


lyyzoDoo 


1 1 Zoo 


p^ rkA04AQ AC 

U-UUol Uo-Uo 


A A ^11 A i^/^r^ A A A A A 1 1 

OAAG 1 1 AGGGAAGAOAA T T 


621 




M^/l ri'j'iCKn 
INM UooooU 


lyyzoooo 


I 1 OQC 

I I Zoo 


p\ AAQ-IAQ AC 

U-UUo1 Uo-Ub 


o A A A \ \ n A Ar^r^r^Tr^ a/^t/^ a 
GAAA 1 1 GAAGGO 1 OAG 1 GA 


coo 

6zz 


VyZUOrTD*f 


INM UooDOU 


1 yyzoDoo 


I 1 OQC 

I I ZOO 


P» AAOIAQ A7 

U-UUol Uo-U/ 


Tnn A An AnnTf^ a a att/^t 
1 GGAAGAGO 1 GAAGA M G 1 


coo 

bzo 




NM UooooU 


ly^zoooo 


11 ZOO 


r\ AAO-IAQ AQ 

U-UUol Uo-Uo 


r^~\ — r/^/^A A/^T/^/^l 1 ATATA 

GG M GGAAGTG GIT ATATA 


CO >l 

624 


















NM UUOZl 1 


Z/ ZOZOOO 


1 /I 

140O 


p\ AAO-IAA AC 

U-UUol uy-uo 


nn An An nTr^T/^ a i r i a 
GGAGAGG 1 G 1 GAGG 1 1 1 GA 


coc 

6zo 


Lror 1 K 


K|Ki| nnco'i i 
NM UUozl 1 


z Zozooo 


14oo 


p* nAO-IAA AC 

U-UUol Uy-Ub 


A A A A r^r^ r^-r a f^f^ i i r^f^ a a 
GAAGAAGGG 1 AGG 1 1 GGAA 


coo 

626 




NM UUDZH 


ZrZOZOOO 


1 ylOA 
140D 


AAHAA A7 

U-UUol uy-u / 


nnAnf^r^Ar^f^~r^r^f^~\ — r a a a 
GGAGGGAGG 1 GGG 1 1 AGAA 


COT 

bz7 


L/or 1 K 


NM uuozn 


z f Zozooo 


A A OA 

14oD 


r\ AAH AA AQ 

U-UUol uy-uo 


n^ a a ^ A a /^/^T/-*/^ a III r^r^ 
GGAAGAAGGTGGAGTTTGG 


coo 

628 


LrOl\ 
















NM_UU4ooo - 


A 7CQri77 


A A Ad 

1445 


U-UUollU-Uo 


A A A T/'* 1 A f 1 A A /^T A 

C AG AATGT ATTG CC AAGT A 


629 




K|hi| nnyiQQQ 
NM UU4oc5o 


~A 7CQA77 


A. A AC 

144o 


r\ AAOHHA AC 

U-UUol lU-Ub 


GAAGAAAG 1 GGGGG 1 GAAG 


630 




NM UU4000 


/I 7CQn77 

4/OOU/ 1 


A A AC 

1440 


pv f\00-1-1A A "7 

U-UUol 1U-U/ 


GGGAG 1 GGG 1 1 A 1 GGAAGA 


631 


L/OrV 


NM UU'frOoO 


yi7CQA77 

4rOOU/ f 


1440 


r\ AAQ'I^IA AQ 

U-UUoHU-Uo 


nn An A Ann nnT An a a n ATn 
GGAGAAGGGGTAGAAGATG 


coo 

632 


nnRi 
















Kl^4 ni IQQyl 

NM ui oyy^ 


7CiCtC\AQ.A 

f ODy4o4 


7Qn 
f OU 


p^ AAO'I 1 *! AC 

U-UUol 1 1-UO 


nn An at nn An i i vn a n i "i t 
GGAGA 1 GGAG 1 1 I GAG 1 1 \ 


coo 

633 




Nivi u 1 oyy^ 


/ OOy 4o4 


7Qn 
f OU 


PI AA'51 1 1 AC 

U-UUol 1 1-ub 


n An AnnnnnTnTn ATn i i i 
GAGAGGGGG 1 G 1 GA 1 G 1 1 i 


CO y1 

634 


LfLriA 1 


NM u 1 oyy^ 


1 ooy4o4 


7Qn 
/ ou 


Pi AAQI 1 1 A7 
U-UUol 1 1-U/ 


nnT nnT A nnT nTn a An a ~rr^ 
GG 1 GG ! AGG 1 G 1 GAAGA 1 G 


coc 

boo 


nnRi 


MM ni '^QQA 

NM u i oyyH- 


f DOy 4o4 


/ OU 


r^_AAQ1 1 1 AQ 

U-UUO 1 1 1-Uo 


Tn A A An AnnTn a An ATn at 
1 GAAAGAGG 1 GAAGA 1 GA 1 


coc 

bob 


DDR2 














DDR2 


NM 006182 


545381 3 


4921 


D-0031 1 2-05 

La/ \J\J\f i 1 \J\J 


G GT AAG AACTAC AC A ATC A 


vIO / 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-06 


GAACGAGAGTGCCACCAAT 


638 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-07 


ACACCAATCTGAAGTTTAT 


639 


DDR2 


NM 006182 


5453813 


4921 


D-0031 12-08 


CAACAAGAATGCCAGGAAT 


640 


DKFZp761 
P1010 














DKFZp761 
P1010 


NM_018423 


8922178 


55359 


D-0031 13-05 


CCTAGAAGCTGCCATTAAA 


641 



107 



DKFZp761 
P1010 


NM_018423 


8922178 


55359 


D-003113-06 


GATTAGGCCTGGCTTATGA 


642 


DKFZp761 
P1010 


NM_0 18423 


8922178 


55359 


D-0031 13-07 


CCCAGTAGCTGCACACATA 


643 


DKF2d761 
P1010 


NM 018423 


8922178 


\J\J\J\JsJ 




00 1 00 1 r\\^\^ 1 OMMO 1 0 1 M 1 


RAA 


torK 














torK 


K\KA nnczooQ 


4ooo1 yo 


A ACC 

19ob 


r\ AAO-I 4 >l AC 

U-00on4-05 


^ A A AAA /^T^ A A TT/^ AAA 

GAAGGAAACTGAATTCAAA 


645 




Kwii nncooft 
InIvI UUOidZo 


A QQC'f AO 

4c>ooiyo 


1 9ob 


r\ AAO-l A A r\Ci 

U-OO01 14-Ob 


^ AAA T A T^^T A /^"T* A /^/^ AAA 

G G AAAT ATGT ACT ACG AAA 


646 


torK 


K|ivi| nncooQ 


A QQC'1 AQ 

4oooiyo 


A ACA 

1956 


r\ AAOH A A f\~T 

D-0031 1 4-07 


CCACAAAGCAGTGAA 1 1 1 A 


647 


torK 


KIKil AncOOQ 

NM UUo^^O 


A QQC-I AQ 

4ooOl yo 


4 ACC 

lyoo 


D-0031 1 4-08 


GTAACAAGCTCACGCAGTT 


648 
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U-UUO 1 Of -uo 


OAOAOA Af^r^r^r^ a ot a tt a a 
UAoAoAAUUUUAU 1 A 1 1 AA 


Q At\ 

o4y 


PTk'7 


MM nn9A9i 

INIVI UU^IOZ I 


97AftRR1 n 
£. ( 0000 1 U 


0 f Of 


pi AH'ii f57 f^^5 

U-UUOI 0 f -UO 


OOAOAOA Ar^ r^r^c*' a ot a i fa 
UoAoAoAAoUUUAU 1 A 1 1 A 


ooU 


PTk7 


INIVI UUZOiC 1 


Zf 00001 U 


Of Of 


pv AnQIfiT A7 

U-UUOI Of -Uf 


r^r^ Af^r^r^ a ottoo a o ato i r 
OoAoUUAo 1 1 ooAoA 1 U 1 F 


Q IT A 

851 


PTK7 


MM nn9ft9i 

INIVI UU^O^ 1 


97RARR1 n 

z / 0000 1 U 


Of Of 


pv nn'J'l R7 HQ 
U-UUOI 0 f -UO 


OAAO AO ATOOOOOTATTTO 

oAAoAUA 1 oUUoU 1 A 1 1 1 o 


OOZ 


PTKQ 














PTkQ 


MM 009^^99 
iNiVI UU^cO^^ 


4'=\nR97A 


0 f 00 


pv 00*^1 RQ nc; 
U-UUO 1 Do-UO 


OA AO A AOT AOO AO AO ATT A 
oAAoAAU 1 AUoAUAoA 1 1 A 


OOO 


PTkQ 


MM nn9ft99 
INIVI UU^O^^ 


*f OUOZ / H 


Of 00 


pv An'ii*?Q no 
U-UUOI 00-uy 


OAAOOAOA OT ATTT AOA OT 

oAAUoAUAU 1 A 1 1 1 AoAo ! 


oo4 


PTKQ 


MM nn9ft99 

INIVI UU^O^^ 


HOUOZ / *f 


Of 00 


n nn'^i ra 1 n 
U-UUO 1 00- 1 u 


riAC^f^C^r^ ATOOTOT ATTOT A 

oAUUooA 1 UU 1 U 1 A 1 1 U 1 A 


OOO 


PTKQ 

1 1 1x57 


MM nn9ft99 

INIVI UU^O^^ 


ARnR974 

H-OUOZ f *+ 


Of 00 


n nn'^i ra 1 1 

U-UUO 1 00-1 1 


OXriO Af^ A OTTOO 1 1 1 ATf^ A 
U 1 oUAoAU 1 1 UU 1 1 1 A 1 oA 


QCA 

c>oo 


PTkQI 














PTkQI 


MM n079A/l 
IN IVI UU / £,0** 


Ol OhOhhO 


A A 'i.AA 
1 1 Off 


pv AnO<1 CO AC 

U-UU01 oy-uo 


AOAOAOAO OTOO A O O A O A T 

AUAUAUAvjU 1 UUACjUACj AT 


857 


PTkQI 


MM nn79A4 
INIVI UU f Z.O't 


0 1 0*fO*f**0 


A A 'XAA 
1 A Off 


pv AA'ilRQ HR 

U-UUO 1 oy-uo 


TT A AOO A OOTO AAOAOAOA 

1 1 AAUoAoo 1 oAAoAUAUA 


oo8 


PTK9L 


NM 007284 


31 543446 


11344 


D-0031 69-07 






PTK9L 


NM 007284 


31543446 


11344 


D-0031 69-08 


GCTGGGATCAGGACTATGA 


860 


RET 














RET 


NM 000323 


21536316 


5979 


D-0031 70-05 


GCAAAGACCTGGAGAAGAT 


861 


RET 


NM 000323 


21536316 


5979 


D-0031 70-06 


GCACACGGCTGCATGAGAA 


862 


RET 


NM 000323 


21536316 


5979 


D-0031 70-07 


GAACTGGCCTGGAGAGAGT 


863 



112 



DPT 


Kl^yl nnn'joQ 
INIVI uVUoZo 


^1 OOOOl D 


CQ7A 

oy / y 


AAOH 7A AO 

U-UUol /U-Uo 


1 1 AAA 1 (joA 1 (jijUAA 1 1 oA 


ob4 


rsVJrx 1 














r\vJr\ 1 


M^/l nncn-io 


40ZD0D 


4yi y 


pv AAO'l 71 AC 

U-UUol / 1 -Uo 


(jUAALjOA 1 0 I 1 1 AO 1 AooA 


QAC 

obo 




INIVI UUOUI^ 


/lft0^5ftA7 
40ZDODf 


4yi y 


n\ AAO-I 7-1 AC5 

U-UUol /^l -Ub 


A/^r^ A A/^/^/^T a a A/^ a ^/^TA 

oACjOAAoVjO 1 AAAIjACjO i A 


obb 


ROR1 


INIVI UUDUIZ 


4oZDoO/ 


AQ^ Q 

4y 1 y 


r\ AAQ-I 7i n7 


Af^ Ar^/^ A a/^tt/"* at/^t a a a 
vjAoAoOAAO 1 1 OA 1 0 1 AAA 




rxwix 1 


INIVI UUDUliC 


40i£0o0 f 


4y ly 


Pi AAH71 AQ 

U-UUo 1 / J-Uo 


A/^ A ATOT/^/^T/^T/^T/^ A A A 

vj AoAA 1 vj 1 00 1 0 1 \j 1 OAAA 


Obo 


RORQ 














DODO 


NM UU40DU 


1 y / 40C>y I 


4y2U 


AA0"170 AC 

U-UUol f 2-Uo 


ooAAO IOCjOToOTCjOO I A 1 


oco 

869 




NM UU4t3DU 


1 y f 4oc5y A 


4y2U 


AA0'i70 rxa 
U-UUol /2-Ub 


CaOACjC^TCjOOTooTOAoATo 


O TO 

870 




INIVI UU40bU 


•X A7/1 QQQ7 

1 y f 4ooy / 


A OOA 

4y2U 


n\ AA0«<70 A7 

U-UUol 72-07 


GoAATGTGoTAGTGTAoGA 


OT-I 

871 


RORO 


INIVI UU400U 


A Q7A QQQ7 

1 y / 4ooy f 


4y2U 


r\ AA0'170 AQ 

U-UUo1 72-Uo 


A A A A ^ A AT AT/^/** 1 1 A 

G AAG AC AG AAT ATG G TTo A 


OTO 

872 


K\JO 1 














KUO 1 




•1 A AO yi 4 C >l 

iyy^4iD4 


bU9o 


D-OO01 73-05 


GAGGAGACCTToTTACTTA 


873 




NM UU2y44 


<4 AAO >1 £i /I 

iyyz4iD4 


CAAO 

bUyo 


U-UO01 7o-U6 


1 1 A A A t^r^ 1" T/^ A A T'l A 

TTAoAGAGGTToAGGATTA 


874 


K\Jo 1 


NM UUZy44 


1 yy^4i d4 


buytt 


AA0'i70 A7 

U-UUo1 iO'Uf 


A A AAA /^/"»T A A A T/^ A A 

G AAo AAAooT AAG o ATG AA 


O TC 

875 


R/^C-I 


NM UU^y44 


1 yy^4i b4 


£5AAQ 

buyo 


r\ AA0 4 70 00 
D-OO01 73-08 


AAA Kr^f> A y^'i ■ A A AHT A A 

GAAAGAGoACTToAAATAA 


876 


DVI/' 














RVI/" 
KYI\ 


NM uuzyoo 


1 1 ODOl OO 


£20CA 

b2oy 


r\ AAO'l7>1 oc 

U-UUol 74-Uo 


A A A A"ri^/"» 1 r A ^e^r^' a a t a 

GAAAGATGGTTAooGAATA 


O TT 

877 




NM uuzyoo 


1 lODO 1 OO 


flOCA 

b2oy 


U-UUol fH'UO 


OAAA /^"T A A 1 1 /^~T~/~* A A /^T — 1* 

oAAAGTAGATTCTGAAGTT 


OTO 

878 


RVk" 


NM uu^yoo 


1 IOOOIOO 


£iOCA 

b2oy 


r\ AAO'17>l A7 

U-UUo1 7^~\j7 


T oAoT AoG oToT ATooTTT 


OTO 

879 


RVl^ 
KYl\ 


NM uu^yoo 


1 1 oool 00 


COCA 

b2oy 


r\ OAO ^ ~7 A no 
U-UUol 74-Uo 


/^T/^ A A A TP A T A A A T A 

G GTG AAG G ATATAGo AATA 


880 


QRr* 














oKO 


NM 0Uo41 / 


2I00I2IU 


6714 


D-0031 75-05 


GAGAACCTGGTGTGCAAAG 


881 


ODO 

OKO 


NM UUo41 7 


21 0DI2IO 


C7'l A 

6714 


D-0031 75-09 


GAGAGAACCTGGTGTGCAA 


882 




NM UUo41 


21ob1z1U 


C7'l A 

6714 


AAO'l7C A r\ 

D-0031 75-10 


GGAG 1 1 1 GCTGGAC 1 1 1 CT 


883 




NM UUo41 / 


21oDl21U 


C7A A 

b714 


0004 7C A A 

D-0031 75-1 1 


AAA /^T/^ A A f^f^ A /^f^ A. A 

GAAAGTGAGACCAoGAAAG 


884 


oYr\ 














oYt\ 


NM UUol 77 


04 0£? H ceo 

21 obi oo2 


6OOO 


D-0031 76-05 


A A T' A A ~I~/^~T"/^ A A A A "T~ A 

G G AAT AATCTC AAG AATC A 


885 


oYr\ 


NM UUol / A 


04 o^SH ceo 

21ob1oo2 


600O • 


D-0031 76-06 


A A /— O /^"POT/^/^T A Ail 

GAACTGGGCTCTGGTAATT 


886 


o Yr\ 


NM UUol f I 


OH OCH ceo 

21 obi oo2 


COCA 

booU 


U-UUol 76-07 


A A A A A /^/^ AAA "1' "1 

GGAAGAATCTGAGCAAATT 


887 


o Yr\ 


NM UUo1 / 1 


OH QC-I ceo 

21ob10o2 


boOU 


U-UUol 76-Uo 


A A A A A ~r^^~¥~^*^ A A ^ A T" 

G AAo AG AC ATGTCAAG GAT 


888 


1 czO 














1 11 


NM UUo^iO 


A CZ.r\~7A OQ 

4oU/42o 


/UUb 


U-UUol 77-U5 


AAA 1 1 /^~T~ /~^~T~ A A A /^T/^ A 

GAAATTGTCTAGTAAGTGA 


889 


1 to 


NM UUoZiO 


4oU / 42o 


7006 


D-0031 77-06 


A /^/^T/^ A A ~1~ 111 A A 1 1 A 

CACCTGAAGTG 1 1 1 AATTA 


890 


1 to 


NM UUo210 


4o0742o 


7rtr\c 

7006 


P\ /\00<4 ~7~7 0 ~7 

D-0031 77-07 


/^~l~ A AAA ^^~f~/~*/^ A A ~r AAA 

GTACAAAGTCGCAATCAAA 


891 


1 to 


NM UUo^lO 


>ICA7>IOO 

4o0742o 


7AAO 

7006 


D-0031 77-08 


TGGAGGAGATTCTTATTAA 


892 


1 tf\ ^ 














\ tr\ 


NM UUU4oy 


A CC7QCiQ 

4oo/obo 


7A4 A 
f U1U 


p\ AAOH 70 r*c 

D-00317QrJD,5 


A A Ai^ A AT AT/^/^/^*¥*/^0 AAA 

GAAAGAATATGCCTCCAAA 


ooo 

893 


1 tl\ 


KIKil AAA>ICA 

NM UUU4oy 


>l CC7QCQ 

4oo7obo 


7 A 4 A 

7U1U 


D-0031 78-06 


/"* A A T/^ A A TO AAA 1 ' 1 1 /•> A 

G G AATG AC ATC AAATTTC A 


894 


1 tl\ 


Klhil AAA>1 CA 

NM UUU4oy 


A CC7QCQ 

4oo7obo 


7 A 4 A 
/U1U 


n\ rvoo A TO r\~7 

D-UU01 78-U7 


TO A A OT A OOT/^ A T A TT/^T A 

TGAAGTACCTGATATTCTA 


895 


1 tN 


KIK>I AAAyl CA 

NM UUU4oy 


VlCC7QfiQ 


7A4 A 
fUlU 


D-0031 78-08 


OO AAA O A OOT A OOTO AATA 

CGAAAGACCTACGTGAATA 


896 


TIC 
1 It 














TIP 
1 It 


NM UUo4^4 


4ooOboU 


7A7C 


r> AAO A TA OC 

U-UUo1 79-Uo 


O A O A OO A OO I'T'l ATOTO A A 

GAG AG GAGG 1 1 1 ATGTGAA 


897 


TIP 
1 It 


NM UU04Z4 


4ooOboU 


7A7C 


r\ AAOH7A AC 

u-uuoi / y-ub 


O O O A O A OOOTOT A OOOTT A 

GGGAOAGCCTCTACCCTTA 


ooo 

898 


TIP 
1 It 


NM UU04^4 


A QQCC50A 

4ooOboU 


7A7C 


n\ AAOH 7A rv7 

U-UUol /y-u7 


A A /^TTOT/^TO O AAA 1 1 /"» /"» 

G AAGTTCTGTG C AAATTG G 


899 


TIP 
1 It 


NM UUo4^4 


4ooOboU 


7A7C 
f U70 


AAO«1 7A AO 

U-UUol f 9-Uo 


A A O A T/-* OOTO A O A A OTO 

CAACATGGCCTCAGAACTG 


900 


1 iNr\ 1 














1 rvirs. 1 


NM uuoyoo 


A CA7C-i A 

4oU/^b1U 


or 1 1 


r\ AAOH OA AC 

U-UUol oU-Uo 


1 1 OT/^^^^^O/* 1 A A/**TOTA A 

GTTCTGGGCCTAAGTCTAA 


901 


1 INf\1 


NM UUoyoO 


ylCA7fi'1 A 

4oUf blU 


or 1 1 


r\ AAQ'IQA Afi 

U-UUo J oO-Ub 


O A A OTO O OTOT A O A A O A TO 

GAACTGGGTCTACAAGATC 


ooo 

902 




INIVI uuoyoo 


A CA7ft'1 A 
40Uf D 1 U 


0/ 1 1 ' 


r\ AAQIQA A7 
U-UUol OU-U / 


OO AO A OOT A TOO OTO A TO A 

OGAGAGG 1 A 1 OGG 1 OA 1 GA 


ooo 

903 


TNK1 






ft71 1 

0 f 1 1 




p p ATPpTr^ A p ATT A 

OO v-zM 1 V^\^ 1 OOMO 1 1 M 


yu4 


TXK 














TXK 


NM 003328 


4507742 


7294 


D-0031 81 -05 


GAACATCTATTGAGACAAG 


905 


TXK 


NM 003328 


4507742 


7294 


D-0031 81 -06 


TCAAGGCACTTTATGATTT 


906 


TXK 


NM 003328 


4507742 


7294 


D-0031 81 -07 


GGAGAGGAATGGCTATATT 


907 


TXK 


NM 003328 


4507742 


7294 


D-0031 81 -08 


GGATATATGTGAAGGAATG 


908 



113 



TVk'O 
1 T 














TVk'O 


rMM UU000I 


^OU / / ^0 


/ zy f 


Pi QO AC 

U-UUolo^-UO 


oACaCaACjAT UOALfOAU 1 1 1 A 




1 T T\£, 


INIVI UUoooi 


ARC\'7~7 ASK 


79Q7 

1 zy f 




r^r^ /^Tr^r^ a att/^o a/^ ax a a 
oUA 1 UOAUA 1 1 oUALfA I AA 


yiu 


\ T 


rMIVI uuooo 1 


Af^C\'77A9K 
*+OU ( ( 'f 0 


79Q7 

f zy / 


U-UUoT <5^-U/ 


T"/^ A A ATA/^/^XA/^i^/^ A/^ A/^T 

1 UAAA 1 AOU 1 Ao 00 AO AO 1 


yi 1 


TVk'O 


rNivi uuo«)«3i 


A^{\77AS^ 


70Q7 


r^-AnH QO HQ 

U-UUol o^-Uo 


OAA loll oO 1 oAOo 1 U 1 1 V7 




TVRO'^ 
1 irsxJO 














1 T r\V^O 


INIVI UU0^90 




7*^0-1 


U-UUol OO-UO 


oO 1 AoAAOo 1 vj 1 OUOA 1 1 1 


A4 0 
yi 0 


1 TKUo 


INIVI UUDZyo 




f OU 1 


U-UUol oo-Ud 


AOCjO 1 (jAOA 1 1 1 AOAAO 1 A 


914 


1 YKvJo 


iNM UUDziyo 


^ oy fyjif 


/ OUT 


U-UUol 00-U7 


GGATGGCTCC 1 1 1 GTGAAA 


915 




MM nnR9Q'^ 




1 OVj 1 


LJ-UUO 1 OO-UO 


OMoMooMMO 1 MOvjAAvjA 1 O 


yi D 


YES1 














YES1 


NM 005433 


21071041 


7525 


D-0031 84-05 


GAAGGACCCTGATGAAAGA 


917 


YES1 


NM 005433 


21071041 


7525 


D-0031 84-06 


TAAGAAGGGTGAAAGATTT 


918 


YES1 


NM_ 005433 


21071041 


7525 


D-0031 84-07 


TCAAGAAGCTCAGATAATG 


919 


YES1 


NM 005433 


21071041 


7525 


D-0031 84-08 


CAGAATCCCTCCATGAATT 


920, 



Table VIII 

5 



Gene 
Name 


Acc# 


Gl 


Locus 
Link 


Duplex # 


Full Sequence 


SEQ. ID 
NO. 


APC2 














APC2 


NM 013366 


7549800 


29882 


D-003200-05 


GCAAGGACCTCTTCATCAA 


921 


APC2 


NM 013366 


7549800 


29882 


D-003200-06 


GAGAAGAAGTCCACACTAT 


922 


APC2 


NM 013366 


7549800 


29882 


D-003200-07 


GGAATGCCATCTCCCAATG 


923 


APC2 


NM 013366 


7549800 


29882 


D-003200-09 


CAACACGTGTGACATCATC 


924 


ATM 














ATM 


NM 000051 


20336202 


472 


D-003201-05 


GCAAGCAGCTGAAACAAAT 


925 


ATM 


NM 000051 


20336202 


472 


D-003201-06 


GAAIGI IGCI 1 ICIGAAI 1 


926 


ATM 


NM 000051 


20336202 


472 


D-003201-07 


GACCTGAAGTCTTATTTAA 


927 


ATM 


NM 000051 


20336202 


472 


D-003201-08 


AGACAGAATTCCCAAATAA 


928 


ATR 














ATR 


NM 001184 


20143978 


545 


D-003202-05 


GAACAACAC I GC 1 GG II 1 G 


929 


ATR 


NM 001184 


20143978 


545 


D-003202-06 


GAAGTCATCTGTTCATTAT 


930 


ATR 


NM 001184 


20143978 


545 


D-003202-07 


GAAATAAGGTAGACTCAAT 


931 


ATR 


NM 001184 


20143978 


545 


D-003202-08 


CAACATAAATCCAAGAAGA 


932 


BTAK - 














BTAK 


NM 003600 


3213196 


6790 


D-003545-04 


C7W\GAATCAGCTAGCAAA 


933 


BTAK 


NM 003600 


3213196 


6790 


D-003203-05 


GAAGAGAGTTATTCATAGA 


934 


BTAK 


NM 003600 


3213196 


6790 


D-003203-07 


CAAATGCCCTGTCTTACTG 


935 


STK6 


NM 003600 


3213196 


6790 


D-003203-09 


TCTCGTGACTCAGCAAATT 


936 


CCNA1 














CCNA1 


NM 003914 


16306528 


8900 


D-003204-05 


GAACCTGGCTAAGTACGTA 


937 


CCNA1 


NM 003914 


16306528 


8900 


D-003204-06 


GCAGATCCATTCTTGAAAT 


938 


CCNA1 


NM 003914 


16306528 


8900 


D-003204-07 


TCACAAGAATCAGGTGTTA 


939 


CCNA1 


NM 003914 


16306528 


8900 


D-003204-08 


CATAAAGCGTACCTTGATA 


940 


CCNA2 














CCNA2 


NM 001237 


16950653 


890 


D-003205-05 


GCTGTGAACTACATTGATA 


941 


CCNA2 


NM 001237 


16950653 


890 


D-003205-06 


GATGATACCTACACCAAGA 


942 


CCNA2 


NM 001237 


16950653 


890 


D-003205-07 


GGTGTTAGCCTCAAAGTTT 


943 


CCNA2 


NM 001237 


16950653 


890 


D-003205-08 


AAGCTGGCCTGAATCATTA 


944 


CCNB1 














CCNB1 


NM 031966 


14327895 


891 


D-003206-05 


CAACATTACCTGTCATATA 


945 


CCNB1 


NM 031966 


14327895 


891 


D-003206-06 


CCAAATACCTGATGGAACT 


946 


CCNB1 


NM 031966 


14327895 


891 


D-003206-07 


GAAATGTACCCTCCAGAAA 


947 



114 





rMivi uonyoD 


i4oz/oyo 


oyi 


U-UUoZUD-Uo 


f^r^ Ar^r^Tf^r^r^T /\. a/^ a Anr/^"r a 
oUAUU 1 ovjU I AACjAA I 0 I a 


QA Q 

y4o 


















nllVI UU'f/Ul 


1 OQ'^ftOi "7 

1 uyooun / 


yi 00 


r\ no*^oo7 oe; 
U-UUoZU I -UO 


A A ^ A A ATOT/^ AAP'AAAr^A 


y4y 


PPMR9 


MM 004701 
INIVI UU*t f U 1 


1 uyoou 1 / 


y 1 00 


n 00*^007 OR 

U-UUo^U f -UD 


Vjl^MVj\-/AAAO 1 00 1 V3AA0A 1 


oco 

you 




MM 00A701 


i OQ'^ROI 7 

1 uyoou 1 1 


y 1 00 


n 00^^007 07 

U-UUO^U f -U f 


cr^Af^Tr^ ATTT r*r* Ar* a atat 

O^Ao 1 oA 1 1 1 oVjAoAA I A 1 


QCi 

yoi 




M^yl on^TOi 

INIVI UU*f#Ul 


luyooui I 


Qi 1Q 

y 100 


r\ noQoo7 OQ 
U-UUoZU / -Uo 


I^T"/^ A/^X A/^/^TT A A /^/^ AT AT 

1 VjAU I AL/V3 1 1 AA(joA 1 A 1 


ACO 

yoz 


















INIVI UooUol 


A A~7A QA A Q 

14/ iy4 ly 


OCA A 7 
0041 / 


iJ-UUoZUo-UO 


T/^ A A P* A A A P'TP* ^T/^ A /"^TT 1 

1 OAAUAAAU 1 1 (jAU I 1 1 


ACO 

953 




INIVI UJoUoT 


14/1 y4i y 


0041 / 


U-UUoziUo-Uo 


O /^T A P^ P*TP* P*TP* P* /^TP* P'TT A 

ov_/ 1 AV:jU I 1 oUU 1 1 1 A 


ACVl 

yo4 


PPKID1 


INM U%5oUol 


14/1 y4i y 


OCA A 7 
0041 / 


Pk nnooHQ n7 
U-UUozUo-U/ 


P* A A P^TP^ A /~*/^"rP'P*TP*T/^P' AT 

UAAU 1 UAUU 1 UCj I 1 CjCjA I 


ACC 

955 




INM UooUo 1 


i4/i y4i y 


QIZ.A A 7 
004l / 


U-UUo2Uo-Uo 


P*"rP*P* AT/^TP*T AP*P*T A ATP* A 

o 1 ooA 1 U 1 U r AuU 1 AATGA 


nee 

956 


















Mhii noc'i on 
INM UUDI9U 


/ oo^4oO 


oy^ 


Pk HAOOAQ AC 

u-uuo^uy-uo 


P^P* AO AP*P*TP*P*P* AP*T ATTTP* 


AC^ 

957 




INM UUDiyU 


/ o<>^4oo 


tjyii 


AAOOAA AC 

u-uuo^uy-ub 


P*0 A OT A P*'r'l "1 P* A A ATAOA A 

LjoAo I Avj 1 1 1 UAAA 1 AOAA 


ACO 

958 


pr*MP 


INM uuoiyu 


/ oo-^4oO 


oy^ 


Pk AAOOAA A7 

u-uuozuy-u / 


(jAUU I 1 1 (jU I UUACj I a I o 1 


AC A 

959 




INM uuoiyu 


f 00^400 


oyz 


r\ AAOOAA AQ 

u-uuozuy-uo 


P^ A/^ ATTP*T ATP*P*P* AP*P*TAT 
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oUAUAUAoAOAAA 1 CjAA 1 1 


4 OCA 

lo&y 


I r DO 


INIVI UUO/Z^ 


c^ylQQi 7 
010*4"0ol f 




U-UUoooU-Uo 


P*^ AP* AP*XP*XXP*XAP» A A XXX 

Lrvj AUAo 1 U 1 1 (j 1 AUAA 1 i 1 


•4 OCA 


1 r / o 














1 r / O 


INIVI UUO'l-Z/ 


**OOOD^4 




PI nriQQoi AC 
U-UUoool -Ud 


P^P'A AP*/^A/^/^/^i^ AX/^ A A /^/^ A 

CjUAAvjjOACjOOUA I OAAC^oA 


1 361 


1 n r 0 


INM UUO^^f 


ASlSiCZCiA A 


71 *51 
( ID 1 


Pi nrnQQi 
U-UUoooI-Ud 


VjAvjAUCjAvjCjAOAUo 1 AU 1 A 


1362 


1 r^/ O 


M^^ nr\«^yi9T 

INIVI-UUD'fZ / 


AfiflczCiAA 
4<5oOD44^ 


T1 *51 


U-UUooo1-U/ 


P^XP'P* AP* A A/^/^X/^ A r^r^ All/-* 

Lf 1 (jV-/AV3AAOO 1 CpAOOA 1 1 


H oco 

1363 


TP73 


MM 00*^497 


4.ftftRR44 


71 R1 


n-on'^'^'^1 -Oft 


00\-/\-fM 1 OOV^ 1 O 1 1 1 r\\^r\r\\D 


1 ODh- 


YWHAZ 














YWHAZ 


NM 003406 


21735623 


7534 


D-003332-05 


GCAAGGAGCTGAATTATCC 


1365 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-06 


TAAGAGATATCTGCAATGA 


1366 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-07 


GACGGAAGGTGCTGAGAAA 


1367 


YWHAZ 


NM 003406 


21735623 


7534 


D-003332-08 


AGAGCAAAGTCTTCTATTT 


1368 



Table IX 



Gene 
Name 


Accession # 


Gl# 


Duplex # 


Sequence 


SEQ. ID 
NO. 


AR 


NM 000044 


21322251 


D-003400-01 


GGAACTCGATGGTATCATT 


1369 


AR 


NM 000044 


21322251 


D-003400-02 


CAAGGGAGGTTACACCAAA 


1370 


AR 


NM 000044 


21322251 


D-003400-03 


TCAAGGAACTCGATCGTAT 


1371 


AR 


NM 000044 


21322251 


D-003400-04 


GAAATGATTGCACTATTGA 


1372 














ESR1 


NM 000125 


4503602 


D-003401-01 


GAATGTGCCTGGCTAGAGA 


1373 


ESR1 


NM 000125 


4503602 


D-003401-02 


CATGAGAGCTGCCAACCTT 


1374 


"ESRI" 


NM 000125 


4503602 


D-003401-03 


AGAGAAAGATTGGCCAGTA 


1375 


ESR1 


NM 000125 


4503602 


D-003401-04 


CAAGGAGACTCGCTACTGT 


1376 














ESR2 


NM 001437 


10835012 


D-003402-01 


GAACATCTGCTCAACATGA 


1377 


ESR2 


NM 001437 


10835012 


D-003402-02 


GCACGGCTCCATATACATA 


1378 


ESR2 


NM 001437 


10835012 


D-003402-03 


CAAGAAGAI ICCCGGCI 1 1 


1379 


ESR2 


NM 001437 


10835012 


D-003402-04 


GGAAATGCGTAGAAGGAAT 


1380 














ESRRA 


NM 004451 


18860919 


D-003403-01 


GGCCTTCGCTGAGGACTTA 


1381 


ESRRA 


NM_004451 


18860919 


D-003403-02 


TGAATGCACTGGTGTCTCA 


1382 


ESRRA 


NM 004451 


18860919 


D-003403-03 


GCATTGAGCCTCTCTACAT 


1383 


ESRRA 


NM 004451 


18860919 


D-003403-04 


GCAGACAGCGGGCAAAGTG 


1384 














ESRRB 


NM 004452 


22035686 


D-003404-01 


TACCTGAGCTTACAAATTT 


1385 


ESRRB 


NM 004452 


22035686 


D-003404-02 


GCACTTCTATAGCGTCAAA 


1386 


ESRRB 


NM 004452 


22035686 


D-003404-03 


CAACTCCGATTCCATGTAG 


1387 


ESRRB 


NM 004452 


22035686 


D-003404-04 


GGACTCGCCACCCATGTTT 


1388 














ESRRG 


NM 001438 


4503604 


D-003405-01 


AAACAAAGATCGACACATT 


1389 
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INIVI UU 1 'fOo 


40U0DU4 


pv AA'JylAK AO 

U-UUo4UO-U2 


1 OAvjCjAAAO 1 o 1 a t oA 1 vjA 


i OAA 

1 oyu 


cor\r\o 


^J^/l nniA'^ft 


40U00U4 


U-UUO4U0-UO 


o AAoAUCrAo 1 OLrAAA 1 1 A\j 


A OAi 

1 oyi 




INIVI UU l*frOO 


40UOOU4 


r^-AAIylAC A/I 
U-UU04UD-U4 


AT/^ A Ar^r^r*^r^Tr*r^Ar*r* att a 
A 1 oAAoOoLr 1 oOAQ^oA 1 1 A 


i OAO 

1 oy2 














MMPzlA 


KIM ^^^A^^7 
INIVI UUUfOf 


Oi Ofii i OA 

c. 1 OD 1 1 o4 


r\ AA'i/lAf; Ai 
L/-UUo4UD-Ul 


UvjAUA 1 OAO 1 vjoAoLrA 1 A 1 


i OAO 

1 oyo 


MMPilA 


KIM 

INIVI UUU*tOi 


Oi oci i QA 
Z 1 OOl 1 o4 


n_AAQ/IAR AO 
U-UU*34UO-U^ 


oAAv3\jAAoOOV3 1 OOAoAA 1 


■i OAyl 

ioy4 


rif>ir*i/\ 


INIVI UUUhO/ 


zi oon lo4 


r^_AAQ>iAft AO 
U-UUo4UO-UO 


r^r^A A/^TA/^ AT/^/^O A/^/^TTT 

OUAAo 1 AUA 1 UOOAoU 1 1 1 


<i OAC 

1395 




INM UUU'l'Of 


Oi QAi i fiyi 
^ 1 OOl 1 o4 


pv AAO/IAfS f\A 

U-UUo4Uo-U4 




4 OAC 

1396 
















hAh^ f\r\A i 
INM UU41 Oo 


DOOl Uc5/ 


r\ AnQ/lA7 Ai 

U-UUo4U /-U 1 


A r^Tr^ a at a a a/^/^tt a a 
(jUAO 1 CjAOA 1 AAAUoTTAA 


<4 OA^ 

1397 




NM UU^H oo 


DDOl \JO( 


P\ AA'3ylA7 AO 

U-UU%54U/ -Uil 


a O a a a a ATO/^ AT/^ a TI^T 

AUAAACjALi A 1 UOA 1 CjA I Cj l 


A OAO 

1 398 




INM UU^lOO 


DOOIUO/ 


r\ nAO>1A7 AO 

U-UUo4Uf -Uo 


A /-^ A AT/^/^ AT/^ AT/-*T A T A A 

ACaACjA 1 ULrA 1 oAl (j I ATAA 


1 399 




Kihii nnyi'ioQ 
INM U041 oo 


ooolOo/ 


AAO>IA7 A>i 

Ll-00o407-04 


A A AT/^ A A/^/^T/^ A/^A^A ATTA 

AAATGAAC(3TGACAGAATA 


1400 
















Kl^^ Hi "7t^QO 
INM U1 fOoZ. 


cjyzo/ /o 


p\ AAOylAQ Ai 

U-UUo4Uo-U1 


A A T/^ A AT/^T A A 1 I I 

o AA 1 Cj AA 1 U 1 AL/AUU i T \(3 


4 >1 A'l 

1401 




KIKA Hi 

INM U1 /OoZ 


yy^io/^ /o 


r~\ AAO/1AO AO 

U-UUo4Uo-0^ 


A A A T A ~n/^ A A A /^T 

GGAAATACGTGGAGACACT 


A A AO 

1402 




NM Uj fOoZ 


QAOQ77^ 

tjyzo/ /O 


AAO>1AO AO 

U-U0o40o-Uo 


1^/^ A A T A A A /^^/-^/^/^ ATA 

OUAGA r AAOTACGGCGATA 


1403 


Lie A loAOR 


INM U1 fOoZ 


oyzo/ /O 


r\ AAOylAQ f\A 

U-UUo4Uo-04 


TGGCGTACCTTCTCATTGA 


A A/y A 

1404 














InKUD I 


NM UUU4/0 


ouiouoy 


AAO/IAA A4 

u-uuo4uy-ui 


A A T/^ ^ A T/^ ATA T^ A T/^ 

UAGCrA 1 GGATGATATGATG 


A A AC 

1405 


INKUd 1 


INM UUU4/0 


OUT oUoy 


r\ AAOylAA AO 

u-uuo4oy-02 


OT/^^X/^ A/^ A 1 1 AT/^ A AT/^ 

CTG CTG AG ATTC ATC AATG 


1406 


iNKUD I 


iNM UUU4/0 


KAi f^AQA 

ouioUoy 


Pi AAOilAA AO 

U-UUo4Uy-Uo 


A A A "r T O AT/^/*^ A A T "¥' A A 

AUAGAi rUATCGAACTTAA 


1407 


INKUd 1 


NM_UUU*i> 1 0 


KAi AAOA 

ouiouoy 


p\ AAO/IAA r\A 

U-UUo4Uy-U4 


A A /^^T/^i^/^/^/^T/^/^T/^T A 

GAAOo 1 GGC/GO 1 OO i G lAO 


A A 

1408 














MRono 


iNM UiiiyDy 


i QOCACAO 

1 ozbyouz 


r> AAO>liA Ai 

U-UUo4 10-01 


A AT AT/~*/^/">T/^/^/^T/^ AAA 

GAATATGCCTGCCTGAAAG 


1409 


KiDrkQO 
NKUd^ 


INM u^iyoy 


i OOCACAO 

1 ozoyou^ 


AAOyiiA AO 

U-UUo4 10-02 


/~* A A T A T/~* /~*/~*T/^/^/^"T"/~» AAA 

GGAATATGCCTGCCTGAAA 


1410 


NKUbz: 


NM u^iyoy 


1 O259502 


AAOil'f A AO 

D-00o4 10-03 


CGTAGCCGCTGCCTATGTA 


1411 




NM u^iyoy 


i oococoo 

1 0^59502 


AAO A A r\ r\ a 

D-00o41 0-04 


GCCATTCTCTACGCACTTC 


1412 
















NM U/i1 / ^4 


i Qyl QAQ/1 7 

1 o4oUo4 / 


n\ AAO>lii Ai 

U-UUo41 1-01 


CAACACAGGTGGCGTCATCTT 


1413 


INK! U1 


NM UZl Z4 


i lA OAQ/I 7 

1 o4oUc>4 / 


AAOylii AO 

U-UUo41 1-02 


/^/"^/^ AT^/^T^ 1 1 A /^T^T/^T ATT 

GGOA T G GTGTTACTGTGTATT 


1414 


INK 1 U 1 


NM UiCi / Z4 


1 o4oUo4 I 


AAOyii i AO 
U-UUo41 1-Uo 


^ A A/^ AT/^/^ ATTO/^/^ AO A A/^TT 

CAACATGCATTCCGAGAAGTT 


A A A C 

1415 


INK! Ul 


NM U^ If 2:4 


1 o4oUo4 ( 


r\ AAO>lii A>l 

U-00o41 1 -04 


GCGC 1- IM GCTTCGTTGTTCTT 


A A A ^ 

1416 














INK 1 


KIKil AA7'10'I 
NM UUfl^l 


i i 00<i AOA 

1 io2io2y 


r\ AAO^HO A-i 

D-00 34 12-01 


A A A A T/^ A A A A A 

GAACAGATCCGGAAGAAGA 


141 7 


NKl 


NM UU/IZl 


i i QOi cork 

1 1 ozl bzy 


f— \ AAOyl'IO AO 

U-00>541 2-02 


A A A A A A T/^/^/^ O A A A 

GAAGAACAGATCCGGAAGA 


1418 


KIDi l-IO 

INK! 


NM UU/1^1 


i i ooi coo 

1 lo^io^y 


f— \ AAO>I4 0 AO 

D-00o4 12-03 


/^T A A /^'/** A A /^T/^/^/^T/^y^ 111' 

CTAAGCAAGTGCCTGG 1 1 I 


1419 


KIDi UIO 


NM UUf 1^1 


i i OO-i fSOA 

1 io^io2y 


r\ AAOyliO f\A 

U-00o41 2-04 


O/^T A A/^ A/^/^O/^/^T/^ A AO A A 

GCTAACAGCGGCTCAAGAA 


1420 














KIDi UQ 

INK! no 


NM uuooyo 


CAOi QAO 

oUol oy^ 


r-\ AAO>iiO Ai 

U-00o41 0-01 


O A A O A O ATOOOOOTO A A A 

GAACAGATCCGCCTGAAGA 


1421 


KIDi Ul'5 

INKl no 


NM UUDoyo 


CAOi QAO 

ouol oy^ 


AAOyii O AO 

L)-UUo41 O-02 


oo A O AT A OTTO A OTT — TOOT 

GGAGATAGTTGAC 1 1 1 GCT 


1422 


Kl D i Ul Q 

INK! no 


NM uuooyo 


CAOi QAO 

bUoloy^ 


r\ AAOyii O AO 

U-U0o41 o-Oo 


O A OTTTOOOTTOOTO AT — i~0 

GAG I I 1 GCCTTG CTCATTG 


A A OO 

1423 


INK 1 no 


NM UUDOyj 


ouoi oyz 


pi AAOyl i O r\A 
U-UUo4 1 0-U4 


TOAOI 1 1 OOTA AAO AOOT A 

1 GALr 1 1 1 GO 1 AAAOAGOTA 


4 y| Oyl 

1424 














MRi MA 
INK! n*!- 


NM UUOi^o 


ii QOf?Q7A 

4o^Dy /y 


P\ AAQyl i >l Ai 

U-UUo414-U1 


O A A OTO A /^r^Tr^f^ A O A A O A A 

OAAG 1 GAOO 1 OGAOAAOAA 


A A^C 

1425 


INK! n*!' 


NM UUOI^O 


>i QOAQ7Q 

4o^oy f y 


pv AAOyl 'I y1 AO 

U-UUo4 i4-U2 


O A A A O A AT — TOO A A A T A OTO 

GAAAG AA i 1 OGAAA 1 AG 1 G 


A A o/:? 

1426 


INK 1 n*f 


INM UUO 1 ZO 


^o^oy / y 


pk AAO/I i A AO 
U-UU041 4-UO 


O A A O A O A OTOTTOT A O A TT 

OAAOAGAO 1 O i 1 O 1 AOA 1 1 


i yi 0"7 

1427 


INK 1 n*t 


NM UUDT^o 


4o^oy /y 


p\ AAOyl i >i Ayl 

U-UUo414-U4 


O A AOO AT A OTOOO A AT AO A 

GAAOOA 1 AO 1 OGOAA 1 AOA 


4 >i oo 

1428 














MR1 19 
INK 1 \£, 


KIM nn'^ftfto 
INM uuoooy 


1 1 OOo 1 oo 


r\ AAOyl IK Ai 
U-UU04 1 O-U 1 


O A AOO A TO OTO AO 1 1 1 OT A 

GAAOOA 1 GO 1 GAO 1 1 1 G 1 A 


A A OA 

1429 


NR1I2 


NM 003889 


1 18631'^'^ 




a AT C^C^ /^CC^OTCAC^ AT AAA 


1 HOU 


NR1I2 


NM 003889 


11863133 


D-0034 15-03 


CAACCTACATGTTCAAAGG 


1431 


NR1I2 


NM 003889 


11863133 


D-003415-04 


CAGGAGCAATTCGCCATTA 


1432 














NR1I3 


NM 005122 


4826660 


D-003416-01 


GGAAATCTGTCACATCGTA 


1433 


NR1I3 


NM 005122 


4826660 


D-003416-02 


TCGCAGACATCAACACTTT 


1434 
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NIR1 1'^ 
INrx 1 to 


KIM nn*%i99 


^OZODOU 


n nn'^Ai r n^^ 

U-UU04 1 D-UO 


K^K^ IK^I \ y^KjK^ 1 AOAOMM 1 1 O 


\ 400 




KIM nn'%199 

INIVI UUO 1 


HOZOOwU 


n.nn'^Ai R-nA 

U-UU04 1 D-U4 


A A r* A l^TTTriTrsr* A/^TTT A 
1 1 1 1 V30M\J 1 1 1 M 


i A'^R 
1 40D 














MR9P1 


MM 00*^907 

iNlVi KJU^^yPf 


'+OU t \j 1 ^ 


n nn'^Ai 7 ni 

Lz-UUO^ 1 /-U 1 


Ar* A/^r* Ar^XTTi AT/^ ATA A 


i A ft7 
1 40 f 


MR9P1 


MM 00*^907 


4^n7R79 


n nn'^Ai 7 n9 

U-UU04 1 r-UZ 


vjoAAooAAo 1 vj I AOAL^O 1 A 


i A 'ift 
1 400 




MM 00*^907 


A'^n7fi79 
H OU f O / z 


n nn'^Ai 7 n*^ 

LJ-UU04 1 f -UO 


oAIjOAOA I O I I V-»AAAk^ 1 AV^ 


i A QQ 

1 4oy 




MM 00^^907 
INIVI UVJO^^ r 


4'?n7ft79 


n_nn'^Ai7 nA 

LJ-UU04 1 f -U4 


A A A A ATTf^r^ A r* AXr* A a a 
OAAOAAA 1 1 OV-/AOA 1 l^AAA 


i AACi 
1 44U 














MR9P9 




/lcn7R7il 
^OU f o / ^ 


LJ-UU04 1 o-U 1 


rxAAr^A Ar^nn~rr^ Ar^ A r^~\ — rr* a 

OAAOAAV_/VJO 1 VjAOAL-r 1 1 L/A 


A AA'i 


NR9P9 


INIVI uuozyo 


/I cn7ft7A 


n nn'^A a ft no 

LJ-UU04 1 O-UZ 


r^xc:. ATHi Anr^Tr^r^ A Ar* axa a 
O 1 oA 1 oAoL/ 1 L/V^AAL/A 1 AA 


144Z 


MR9P9 


MM nn'^9Qft 

INIVI UUO^b^O 


4*^07^7/1 
*tOU # D # *f 


L/-UU04 1 O-UO 


A A r^~r A A n~r a ax/^xxx/^ 
V/AAOO 1 AAo 1 AA 1 L/ 1 1 1 o 


'i AA^ 

1 44o 


MR9P9 


MM an^kOQSi 

INIVI UUo^^O 


A*^n7^^7A 

^OUf Of *t 


n nn'^Aift c\a 

U-UU041 0-U4 


<^ A A A A X A A XX 

oAAoAOAoO 1 AOL-^oA 1 i oVj 


^AAA 

1444 
















MM nn'^9^^Q 

INIVI UU^^ZDk7 


9i Qfti i no 

Z 1 OD 1 1 UO 


n nnQyiio ni 
L/-UU041 y-ui 


AX/^ AX AX/^X/^ AAAXAr^A/^ 

oA 1 OA I A 1 O 1 oAAA 1 AUAlj 


1440 


MR9P1 


MM nn'^9RQ 
INIVI UUOZOv7 


9i '^Ri i nft 

Z 1 OO 1 1 UO 


n nn'^Aio n9 
L'-UU04 1 y-uz 


A A A r*xr^ r*xxxr* A n A X A X 

V^AAoAV^ 1 oV-^ 1 1 1 OAoA 1 A 1 


^AAd 

i44D 




MM 00*^9^0 
INIVI UUO^O%7 


Z 1 OO 1 1 Up 


n nn'^Ai o 
U-UU04 1 y-uo 


<^ XX A A x<^ r^x A r*x A A XXX 
o 1 1 AoA 1 yjy^ 1 AO 1 oAA 1 1 1 


^ AA7 

144/ 


MR9P1 


MM nn'^9^?Q 

INIVI UUOZOc^ 


91 '^Ri i nft 
Z1 OD 1 1 UO 


ri_nnftyiiQ n>i 
L/-UU041 y-U4 


r* A AX/^XAXr^X/^X AXr* A A^X 

oAA 1 V3 1 A 1 o 1 O 1 A 1 o AAo 1 


144o 
















MM niA94Q 

INIVI u 1 'fZ'fy 


7R'^7'^QA 

f DO / Ov7H- 


LJ-UU04ZU-U1 


n An A A rm~m r^~r~t~m~r ax a 
oAoAAoO 1 OO 1 1 1 o 1 oA 1 A 


A AQ 

144y 


MR9P'^ 


MM niA9AQ 


7R*^7'^QA 

I DO 1 oy*fr 


n nnft/ion no 
U-UU04ZU-UZ 


n A Anr^ a/^x Axor^r^ ax/^xax 
vjAAoOAO 1 A I ooOA 1 O 1 A 1 


14oU 




MK^ niA94Q 

INIVI u 1 'fZ'fy 


/ DOr Oy*!- 


L/-UU04ZU-UO 


oAAvjOA 1 OO 1 VjAoOAOvj 1 A 


i401 


MR9P'^ 


MM niA9AQ 
INIVI U l^f-ZM-y 


7R(^7'^QA 

r DO / oy'f 


p\ nnft>!on_nii 
U-UU04ZU-U4 


n A A ^/^X/^I^XXX/^Xr* AX AX/^ 

o AAIjO 1 OO 1 1 1 o 1 oA 1 A 1 o 


•1 A CO 

140Z 














MR9P1 


INIVI UUODO*r 


Ofii 9 7 Aft A 
ZUl Z f '•■o*f 


r\_nnftA9i _i^i 

L/-UU04Z 1 -U 1 


nA A A r^T r*~rr^ AT r^r^r^ t^r^ ax a 
oAAAO 1 O 1 OA 1 OOoOoA 1 A 


■i A C3 

14oo 


MR9P1 


INIVI UUODO*!- 


9ni 07>1Q/I 

zun z / 'fro'l- 


L/-UU04Z 1 -UZ 


TnTC^ ATf^r^r^nr^ ax axotx a 
1 O 1 OA 1 OOoOoA 1 A 1 O 1 1 A 


14o4 


MR9P1 


INIVI UUODO*f 


Oni 07AQ.A 
ZU iZ / ^o^f 


n nn':i>ioi ni 
U-UUo4Zl -Uo 


n A Ar^ A Ar^~rr^nnTr^ A a a/^x/^ 
OAAoAAo 1 oOO 1 OAAAo 1 o 


A ACC 

14oo 




MM nnf^ARA 

INIVI UU0004 


Oni 07ylQi4 
ZUl Z f 'fo'fr 


r»_nn'i>ioi f\A 
U-UU04Z 1 -U4 


r^i^ A Af^ 1 1 A A/^XXA/^ A/^ AX^^ 

oVjAAO 1 1 AAO 1 1 AOAOA 1 o 


1400 














MR9P9 


INM UZIUUO 


i /1 1 /1Q7/1 

n 41 4y / 40 


U-UU04ZZ-U1 


r^~r f\r^r^^r^T r^r^ n ax ax a xt 
o 1 AOO 1 \3 \ OOoVjA 1 A 1 a 1 1 


A A C"7 

1457 


MR9P9 
IN rxZ r Z 


INIVI 1 UUD 


1 /I i /1Q7/1 K 

14 I'l-y /40 


p\ nnQyioo no 
IJ-UU04ZZ-UZ 


nn A AnnAnnnnAnr^An axx 
OOAAOOAvjOOoAOoAOA 1 1 


A A CZQ 

14oo 


MR9P9 


MM n9inn';; 

INIVI UZ 1 UUO 


i il i ylQ7/l*^ 
I414y f 40 


Pi nnQyioo nQ 
U-UUo4ZZ-Uo 


A nTnnT a nn~m~mnnn ax a 
AO 1 Oo 1 AOO 1 vj I OOoQ^A 1 A 


A A cr\ 

i4oy 


MR9P9 


MM 09100*^ 
INIVI UZ 1 UUO 


14 i*i>y#40 


n CKCS'^AOO f\A 
L/-UU04ZZ-U4 


nnnn n~r AT A~mnn A a tx^ a 
ooOOo 1 A 1 A 1 ooOAA 1 1 OA 


A ACif\ 

14dU 














MR9PR 


INIVI UU0ZO4 


zuu/ uiyo 


n\ nni/ioQ ni 
U-UU04Z0-U 1 


OijAOoOO 1 vj 1 \jVjOO 1 O 1 OA 


A AC A 

14d1 




INIVI UUOZo4 


9nn7niQft 
zuuf unyo 


Pi nn*a>ioQ no 
LI-UUo4Zo-UZ 


n AnnnnnTnTnnn a i\nTn a 
OAoOOOvj 1 O 1 OOoAAO 1 OA 


14bz 


MR9PR 


INIVI UUOZoH- 


zuu/^ unyo 


pi riA'i/io'i nQ 
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1561 


VDR 


NM 000376 


4507882 


D-003448-02 


GCAAGGAAGACTACAAGTA 


1562 


VDR 


NM 000376 


4507882 


D-003448-03 


TCAATGCTATGACCTGTGA 


1563 


VDR 


NM 000376 


4507882 


D-003448-04 


CCATTGAGGTCATCATGTT 


1564 



Table X 
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Symbol 






ABCB1 


GACCAUAAAUGUAAGGUUU 


1565 




UAGAAGAUCUGAUGUCAAA 


1566 




GAAAUGUUCACUUCAGUUA 


1567 




GAAGAUCGCUACUGAAGCA 


1568 



ABCC1 



Sense 




GGAAGCAACUGCAGAGACA 


1569 


GAUGACACCUCUCAACAAA 


1570 


UAAAGUUGCUCAUCAAGUU 


1571 


CAACGAG UCUGCCG AAGGA 


1572 



ABCG2 



Sense. : 




GCAGAUGCCUUCUUCGUUA 


1573 


AGGCAAAUCUUCGUUAUUA 


1574 


GGGAAGAAAUCUGGUGUAA 


1575 


UGACUCAUCCCAACAUUUA 


1576 



KCNH2 



. .I.' .. l2- Sense*,. ^ 




CCGACGUGCUGCCUGAGUA 


1577 


GAGAAGAGCAGGGACACUU 


1578 


GAUCAUAGCAGGUAAGAUA 


1579 


GCUAUUUACUGCUCUUAUU 


1580 


UCACUGGGCUCCUUUAAUU 


1581 


GUGCGAGCCUUCUGAAUAU 


1582 


GCUAAGCUAUACUACUGUA 


1583 


UGACGGCGCUCUACUUCAC 


1584 



KCNH1 



Sense -1 m '^t 


■ . H 






GAGAUGAAUUCCUUUGAAA 


1585 




GAAGAACGCAUGAAACGAA 


1586 




G AU AAAG AGACG AU UG AAA 


1587 




GCUGAGAGGUCUAUUUAAA 


1588 





CLCA1 



SLC6A1 



SLC6A2 



SLC21A2 



-S'? - m Sense- T- • 'M? - 




GAACAACAAUGGCUAUGAA 


1589 


GUACAUAGCUGGCUGGAUU 


1590 


GAACAGCUCACAAGUAUAU 


1591 


GGAAACGUGUGUCUAUAUU 


1592 




-p- 'f-' ■Siens-e\* ■% ■ :f 


1:^ .A 


GGAGGUGGGAGGACAGUUA 


1593 


UCACAGCCCUGGUGGAUGA 


1594 


GAAGCUGGCUCCUAUGUUC 


1595 


GGUCAACACUACCAACAUG 


1596 




Sense 




GAACACAAGGUCAACAUUG 


1597 


AGAAGGAGCUGGCCUAGUG 


1598 


CGGAAACUCUUCACAUUUG 


1599 


CAACAAAUUUGACAACAAC 


1600 




Sen^ii?,> ■ 




GUACAUCUCCAUCUUAUUU 


1601 


GGAAGUGGCUGAGUUAUUA 


1602 
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SLC21A3 



SLC28A1 



SLC29A1 



SLC26A1 



SLC26A2 



SLC4A4 



GLRA1 



GAAGGGAGGCUCAAUGUAA 


1603 


GAAGGAAGUGGCUGAGUUA 


1604 




Sense 




G U AG AAAC AGG AGC U AU U A 


1605 


CAAGAUUACUGUCAAACAA 


1606 


GCACAAGAGUAUUUGGUAA 


1607 


GCAAAUGUCCCUUCUGUAU 


1608 


GCAUGACUCCUAUAUAAUA 


1609 


AAACAGCAAUUUCCCUUAA 


1610 


GAAAAUGCCUCUUCAGGAA 


1611 








GUUCAUCGCUCUCCUCUUU 


1612 


GGAUCAAGCUGUUUCUGAA 


1613 


GGACUGCAGUUUGUACUUG 


1614 


GAGUGAAACUGACCUAUGG 


1615 




i.- ik>.. „;Ji3ense.'^^ rM:.^^-iM 




GAACGCUGCUCCGGUGGAA 


1616 


GAAAGCCACUCUAUCAAAG 


1617 


GAAACCAGGUGCCUUCAGA 


1618 


CCUCACAGCUGUAUUCAUG 


1619 






CCACGGAGCUGCUGGUCAU 


1620 


GGGUUGACAUCUUAUUUGA 


1621 


GCACGAGGGUCUCUGUGUU 


1622 


GGCCAUCGCCUACUCAUUG 


1623 


CAACACCCAUGGCAAUUAA 


1624 


GAGGAAAGAUCUUGCUGAU 


1625 


GAGCAAGCGUCCUCCAAAU 


1626 


GCAACACCCAUGGCAAUUA 


1627 






CCAAAGAACUCAAUGAACA 


1628 


ACAAGAACCUUCAGACUAA 


1629 


GAAGGUAGAUAGAAGAAUG 


1630 


GUAUUGAACUGUACUGUAA 


1631 




1% - Sense ^T. „f<^fe^,-. ^ ^ 




GCAAUUCUCUUCAUUUAUC 


1632 


GGAAAGAUGUCCACUGAAA 


1633 


GGACAAAGCCUUCUUCAAU 


1634 


GGAAUGGGAUCCAGCAAUU 


1635 




Sensed'.. 




UGAAAGCGAUUGACAUUUG 


1636 


CAGACACGCUGGAGUUUAA 


1637 


CAAUAGCGCUUUCUGGUUU 


1638 


1 GCAGGUAGCAGAUGGACUA 


1639 



KLK1 



^: ■ Sensejte; 




UCAGAGUGCUGUCUUAUGU 


1640 


CAACUUGUUUGACGACGAA 


1641 


UGACAGAGCCUGCUGAUAC 


1642 



130 



AGGCGGCUCUGUACCAUUU 



1643 



ADAM2 



XPNPE 
PI 



Sense 




GAAACAUGCUGUGAUAUUG 


1644 


GCAGAUGUUUCCUUAUAUA 


1645 


CAACAGAGAUGCCAUGAUA 


1646 


GAAAGGCGCUACAUUGAGA 


1647 




Sense ..r 




G ACCUGAGCU UCCCAACAA 


1648 


GCGACUGGCUCAACAAUUA 


1649 


GAGAUUGCGUGGCUAUUUA 


1650 


GACAGCAACUGGACACUUA 


1651 



GZMA 





1 '^^^^^■^ - ■■ 


GGAAGAGACUCGUGCAAUG 


\ 1652 


GGAACGAUGUGCCAAGUUG 


1653 


GAAGUAACUCCUCAUUCAA 


1654 


GAACUCCUAUAGAUUUCUG *' 


1 1655 



CMKLR1 





11 '-5 :m 


CAU AG AAGCU U U ACC AAG A 


1656 


GAAUGGAGGAUGAAGAUUA 


1657 


GGUCAAUGCUCUAAGUGAA 


1658 


GAGAGGACUUCUAUGAAUG 


1659 



CLN3 



CALCR 



OXTR 



EDG4 



EDG5 



m^"^ 3^^3^ -^'^R? ^ M 




CAUCAUGCCUUCUGAAUAA 


1660 


CAACAGCUCAUCACGAUUU 


1661 


GCAACAACUUCUCUUAUGU 


1662 


GGUCUUCGCUAGCAUCUCA 


1663 




^ , . Sense . 




GGACCUAGCUGUUGUAAAG 


1664 


G AAAG ACCAUGC AU U U AAA 


1665 


GCAGGAAGAUGUAUGCUUU 


1666 


GAAUAAACCAGUAbKSGUUA 


1667 




y"£' ^'t' Sense ' " "^^^W'> ::M: 


GGACCCAGAUAUCCAAAUA 1668 


GCAAUACUAUCCUAACUGA 


1669 


GAAUAUAGAUUAGCGUUUG 


1670 


GAUGAGGCAUGACUACUAA 


1671 




iW "" Sense ' ^fg' : "^^W 




GCGAGUCUGUCCACUAUAC 


1672 


GAGAACGGCCACCCACUGA 


1673 


GAACGGCCACCCACUGAUG 


1674 


GGUCAAUGCUGCUGUGUAC 


1675 




Sense 




UCCAGG AAC ACU AU AAU U A 


1676 


GUGAGCAUCUUCUCCAUCA 


1677 


CAUCCUCUGUUGCGCCAUU 


1678 


CCAACAAGGUCCAGGAACA 


1679 



EDG7 



PTCH 







ACACUGAUACUGUCGAUGA 


1680 


AAUAGGAGCAACACUGAUA 


1681 


CAGCAGGAGUUACCUUGUU 


1682 


GGACACCCAUGAAGCUAAU 


1683 








GCACAGAACUCCACUCAAA 


1684 


GGACAGCAGUUCAUUGUUA 


1685 


GAGAAGAGGCUAUGUUUAA 


1686 


GGACAAACUUCGACCCUUU 


1687 






,. II 


UCGCUACCCUGCUGUUAUU 


1688 


GCUACAAGAACUACCGAUA 


1689 


CAAGAAAGCUUCCUUCAAC 


1690 


GAGAAGAAAUACAGUCAAU 


1691 






CAAUAUAUCUGAAGAGCUA 1692 


GAACUGGACUGUGGCAUUG 


1693 


GUGAGAAGAUGGUAUAUUU 


1694 


GAGGGUACUUUAAGACAUA 


1695 








CAUGAGGUGUCAACUGUUA 


1696 


GAAGUGAAAUGCUUUAAUG 


1697 


AAAUAUGGCUCCUCCUUAG 


1698 


GCAAUCACAUUUAUGCAUA 


1699 


CAACAUAACUGAGGUGGAU 


1700 


CAUGGUACAUUCAAGAUUU 


1701 








GAACUCUACUUCAGUCAAU 


1704 


GGGCAAAUGCAUCAUAAUA 


1703 


CAACAGAGGGAGUUUAAUA 


1704 


GAACAAAGCCACUGACXUJ3A 


1705 








GAAGUGAACUAUGAAGUAA 


1706 


CAACAAGGAUGACAAGAAA 


. 1707 


GGACAAAGUUUACCAAAUG 


1708 


GAGGGUCGAUCAUCUAUUA 


1709 


GAAUAUAGAGGGCUUAUGA 


1710 


CAACGACUAUGAAGAAUUC 


1711 


GAAGUGAGCAGAUCAGAAU 


1712 


GAGGAAAUCUCCAAAUGCA 


1713 




II ©gODg© II 1 


CCAGGCAGCUGAUCAUAGA 


1714 


UCUCAGGUGUUGCCAAAUA 


1715 


GAACAGCUGUAAUCUAUGA 


1716 


CCACUGGUCUGUAGGGAUU 


1717 



DVL2 



PTEN 



PDK1 



PDK2 



PPP2CA 



CTNNA1 



CTNNA2 



HSPCA 



DCTN2 



UCGUAAAGCUGUUGAUAUC 


1718 


GAGGAGAUCUUUGAUGACA 


1719 


GUAAAGCUGUUGAUAUCGA 


1720 


GAUCGUAAAGCUGUUGAUA 


1721 




Sense 




AGACGAAGGUGAUUUACCA 


1722 


UGUGAGAGCUACCUAGUCA 


1723 


GAAGAAAUUUCAGAUGACA 


1724 


UAAUAGGCAUUUCCUCUUU 


1725 




Sense 




GUGAAGAUCUUGACCAAUG 


1726 


GAUCAGCAUACACAAAUUA 


1727 


GAAUGAACCUUCUGCAACA 


1728 


GGCGCUAUGUGUAUUAUUA 


1729 




: ' j;/- ■ Sense 




GUACAAAGCUGGUAUAUCC 


1730 


GAAAGACUCCCAGUGUAUA 


1731 


GGAAGUCCAUCUCAUCGAA 


1732 


CCAAAGACAUGACGACGUU 


1733 




" '~ Sense ;-:;^rF-'r:-"^^ 




GUAAAGAGGAGACUGAAUG 


1734 


GGUCUGUGAUGGUCCCUAA 


1735 


CAAAGAUGCCUACGACAUG 


1736 


GGGCGAUGCCUGAGGGUUA 


1737 




Sense . - ; i 




UCACACAAGUUUAUGGUUU 


1738 


CAACAGCCGUGACCACUUU 


1739 


UAACCAAGCUGCAAUCAUG 


1740 


GAACUUGACGAUACUCUAA 


1741 




Sense - . 




GAAGAGAGGUCGUUCUAAG 


1742 


AAGCAGAUGUQCAUGAUUA 


1743 


UCUAAUAACUGCAGUGUUU 


1744 


GUAAAGGGCCCUCUAAUAA 


1745 




Sense 




GAAAGAAUAUGCCCAAGUU 


1746 


GAAGAAGAAUGCCACAAUG 


1747 


GCAGGAAGAUUAUGAUGUG 


1748 


AAAGAAAGCCCAUGUACUA 


1749 




Sense 




GGGAAAGAGCUGCAUAUUA 


1750 


GCUUAGAACUCUUUACUGA 


1751 


UAUAAGAGCUUGACCAAUG 


1752 


GCAGAUAUCUCUAUGAUUG 


J 1753 




- SQhse 




CAACUCAUGUCCAAUACUG 


1754 


GGAAUGAGCCAGAUGUUUA 


1755 
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CD2 



BAD 



SMAC 



MAP3K5 



PVR 



ERBB2 



SOS1 



BRCA1 



GGAGACAGCUGUACGUUGU 


1756 


UCCAAGAGCUGACAACUGA 


1757 




1 ©sooas 1 


II 


GUAAGGAGAAGCAAUAUAA 


1758 


AAG AUG AGCU U U CC AUG U A 


1759 


GGACAUCUAUCUCAUCAUU 


1760 


GACAAGAGCCCACAGAGUA 


1761 






II 


GUACUUCCCUCAGGCCUAU 


1762 


GGUGUGCCUUGACUACGUA 


1763 


GUACUUCCCUCAGGCCUAU 


1764 


GGUCAGGUGCCUCGAGAUC 


1765 






1 . ... , ni 


CAGCGUAACUUCAUUCUUC 


1766 


UAACUUCAUUCUUCAGGUA 


1767 


CAGCUGCUCUUACCCAUUU 


1768 


GAUUGAAGCUAUUACUGAA . 


1769 


UAGAAGAGCUCCGUCAGAA 


1770 


CCACAUAUGCGUUGAUUGA 


1771 


GCGCAGGGCUCUCUACCUA 1772 






GAACAGCCUUCAAAUCAAA 


1773 


GAUGUUCUCUACUAUGUUA 


1774 


GCAAAUACUGGAAGGAUUA 


1775 


CAGGAAAGCUCGUAAUUUA 


1776 








CCACACGGCUGACCUCAUA 1777 


CAGCAGAAUUCCUCUUAUA 1778 


GCAGAAUUCCUCUUAUAAA 1779 


GAUCGGGAUUUAUUUCUAU 


1780 






1 1 


ugugggagcugaujgacuuu 


1781 


UCACAGAGAUCUUiGAAAGG 


1782 


UGGAAGAGAUCACAGGUUA 


1783 


GCUCAUCGCUCACAACCAA 


1784 






1 -mry ,1 


G AGC ACC ACU UCU AU G A U U 


1785 


CAAAGAAGCUGUUCAAUAU 


1786 


UGAAAGCCCUCCCUUAUUA 


1787 


GAAAUAGCAUGGAGAAGGA 


1788 






1 1 


CCAUACAGCUUCAUAAAUA 


1789 


GAAGAGAACUUAUCUAGUG 


1790 


GAAGUGGGCUCCAGUAUUA 


1791 


GCAAGAUGCUGAUUCAUUA 


1792 


GAAGUGGGCUCCAGUAUUA 


1793 


GAACGGACACUGAAAUAUU 


1794 


GCAGAUAGUUCUACCAGUA 


1795 
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CDKN1A 



CDKN1B 



SLC2A4 



NOS2A 



Sense 




GAACAAGGAGUCAGACAUU 


1796 


AAACUAGGCGGUUGAAUGA 


1797 


GAUGGAACUUCGACUUUGU 


1798 


GUAAACAGAUGGCACUUUG 


1799 




Sense 




GGAAUGGACAUCCUGUAUA 


1800 


GGAGAAAGAUGUCAAACGU 


1801 


GAAUGGACAUCCUGUAUAA 


1802 


GUAAACAGCUCGAAUUAAG 


1803 




hI; ^ r^-l' ' .SenSjB ^ :Mc- 




CAGAUAGGCUCCGAAGAUG 


1804 


AGACUCAGCUCCAGAAUAC 


1805 


GAUCGGUUCUUUCAUCUUC 


1806 


CAGGAUCGGUUCUUUCAUC 


1807 




'^r r: Sense ' • 


1 


CCAGAUAAGUGACAUAAGU 


1808 


UAAGUGACCUGCUUUGUAA 


1809 


GAAGAGAGAUUCCAUUGAA 


1810 


UGAAAGAGCUCAACAACAA 


1811 



FRAP1 



FKBP1A 



TNFRSF 
1A 



IL1R1 



IRAKI 



fi^^ Sense-, ■ 




GAGCAUGCCGUCAAUAAUA 


1812 


CAAGAGAACUCAUCAUAAG 


1813 


CCAAAGUGCUGCAGUACUA 


1814 


UAAGAAAGCUAUCCAGAUU 


1815 




■ '. Sense.,: 




GAAACAAGCCCUUUAAGUU 


1816 


GAAUUACUCUCCAAGUUGA 


1817 


CAGCACAAGUGGUAGGUUA 


1818 


GUUGAGGACUGAAUUACUC 


1819 


GAUGGCAGCUGUUUAAAUG 


1820 


GAGUAUCCUUUCAGUG.UUA 


1821 








CAAAGGAACCUACUUGUAC 1822 


GGAACCUACUUGUAGAAUG 


1823 


GAACCUACU UG UACAAUGA 


1824 


GAGUGUGUCUCCUGUAGUA 


1825 




i'M'. i t-'i^-^ Sense" ; - ^ 




GGACAAGAAUCAAUGGAUA 


1826 


GAACAAGCCUCCAGGAUUC 


1827 


GGACUUGUGUGCCCUUAUA 


1828 


GAACACAAAGGCACUAUAA 


1829 




Sense 




CGAAGAAAGUGAUGAAUUU 


1830 


GCUGUUUGCCCAUCUCUUU 


1831 


UGAAAGACCUGGUGGAAGA 


1832 


GCAAUUCAGUUUCUACAUC 


1833 
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GAAGACAGAGUUAUUAAAC 


1834 


UCACGAAGAGAGAGUUAUU 


1835 


AGACAGAGUUAUUAAACCA 


1836 


CAGGAAGAGAGAGUUAUUA 


1837 


GCUGAAGCCUGUCUGAUGU 


1838 



TRAF6 



Sense': - 


CAAAUGAUCUGAGGCAGUU 


1839 


GUUCAUAGUUUGAGCGUUA 


1840 


GGAGAAACCUGUUGUGAUU 


1841 


GGACAAAGUUGCUGAAAUC 


1842 


CAAAUGAUCUGAGGCAGUU 


1843 


GGAGAAACCUGUUGUGAUU 


1844 


GGACAAAGUUGCUGAAAUC 


1845 


GUUCAUAGUUUGAGCGUUA 


1846 







UGAAGCACCUUGAUCUUUG 


1847 


GGGCAGCGCAUACCUGUUU 


1848 


GAGGAGCGCUGUUUGAGUU 


1849 


GGACGAGGAGCGCUGUUUG 


1850 


GAGGAGCGCUGUUUGAGUU 


1851 


GGAUGUCUCUCUCCUCUUU 


1852 


GCUCACUCCUUUCUACUAA 


1853 


UGAAGCACCUUGAUCUUUG 


1854 



' $SSi^^m^>^^M^% Sense i 




GCACAGAUAUUUCCAUUUC 


1855 


GCAGUCCUCUUAUUCCUAA 


1856 


GAACUCAAGCUGCGUUUAU 


1857 


GGACGAAUUGAGAUAAUAU 


1858 







UAAGAACACUGCUCAUGAA 


1859 


GAGGCAUCCUGAAGCAUUA 


1860 


GAAGGCGGCUGCAGAACUG 


1861 


GGAACAAGGAGAUCAUGUA 


1862 



' - . - i Sense ' '*l«1%ffl 




CUAUCGAGGUCGUUAAAUU 


1863 


GAAUGCAGGUGGAAGAUCU 


1864 


GCGGCGAGCUGGACUGUUU 


1865 


CCAGACCGAUGUGUAUUUA 


1866 



TNFRSF 
5 











GGUCUCACCUCGCUAUGGU 


1867 


GAAAGCGAAUUCCUAGACA 


1868 


GCACAAACAAGACUGAUGU 


1869 


GAAGGGCACCUCAGAAACA 


1870 


UCUCCCAACUUGUAUUAAA 


1871 







UCAAGUGUCUUCCAUCAUG 


1872 


UCAAGUGCCUUAAUAGUAG 


1873 



GGAGUACCCUGAGGCUAUA I 


1874 


GAUGAGAUCUUCCUACUGU | 1875 




Sense 




GAGCUGGGCUAAGUAAAUA 


1876 


GACCAAAGAUGGAGUGAGA 


1877 


GGAAGAAACUGGUGAUUGU 


1878 


GGCUGUAACUACUUUAUAA 


1879 




Sense 




GGACAUUUGUUUGCCAUUU 


1880 


GGAGAACCAUAUACUCUUG 


1881 


GAACCAAUGCUUUCUCAUG 


1882 


GAAGACCUGUUAUGUAGAG 


1883 


GAUCAAGAAUUGCAAUAUC 


1884 


GAAAAGGGGUGACCUAGUA 


1885 


UGACAAACCUUAUGGAAAA 


1886 




: -'IWC ■ Sense- . ". <Vm^: 




GGAAUGAGCUUCAGAUGCA 


1887 


GGACACAGCUGUAAGAUUG 


1888 


GACAAGAGAUUACAGAUAA 


1889 


GAAGAAACAUUCCCUAUUC 


1890 




'Sense 




GAGGGUGGUUUAUGAUUAA 


1891 


CAACAAAGAACAAUCACUA 


1892 


GAAGAAAUAUACACGGUUU 


1893 


UACAUGAGCUUUACAGAUA 


1894 






1 ^tr» . 


GGUAGGAGAUGAAUUGUUU 


1895 


AGAAGGAACUGAUCAUUAA 


1896 


CUACAGACCUCCAAUAUCA 


1897 


GAAACUGGCCAAACCGU UA 


1898 








GAUUAUCGCUGCAAAGGAA- 


1899 


GAGAGUGCCUGCAAGCUUD' 


1900 


GACAAGAGGUGGCCAUAAA 


1901 


UUAAAUCGCUGUCUUGAGA 


1902 



PAK4 







ACUAAGAGGUGAACAUGUA 


1903 


GAUCAUGAAUGUCCGAAGA 


1904 


GAUGAGACCCUACUACUGA 


1905 


CAGCAAAGGUGCCAAAGAU 


1906 



PAK6 







UAAAGGGAGUUGUCCACUA 


1907 


GAAGGGACCUGCUUUCUUG 


1908 


GCAAAGACGUCCCUAAGAG 


1909 


CCAAUGGGCUGGCUGCAAA 


1910 



PAK7 


Sense 






GAGCACGGCUUUAAUAAGU 


1911 i 



HDAC1 



HDAC2 



CREBBP 



BTRC 



RIPK2 



VAV1 



VAV2 



GRB2 



PLCG1 



CAAACUCCGUUAUGAUAUA 


1912 


GGAUAAAGUUGUCUGAUUU 


1913 


GGAAAUGCCUCCAUAAAUA 


1914 




Sense 




GGACAUCGCUGUGAAUUGG 


1915 


AGAAAGAAGUCACCGAAGA 


1916 


GGACAAGGCCACCCAAUGA 


1917 


CCACAGCGAUGACUACAUU 


1918 




Sense 




GCUGUUAAAUUAUGGCUUA 


1919 


GCAAAGAAAGCUAGAAUUG 


1920 


CAUCAGAGAGUCUUAUAUA 


1921 


CCAAUGAGUUGCCAUAUAA 


1922 




. '-^Sense ^ \ ~ -^lik-- \ 




GGCCAUAGCUUAAUUAAUC 


1923 


GCACAGCCGUUUACCAUGA 


1924 


GGACAGCCCUUUAGUCAAG 


1925 


GAACUGAUUCCUGAAAUAA 


1926 




>1Sense ^ ^: ^ ^ 




CACAUAAACUCGUAUCUUA 


1927 


GAGAAGGCACUCAAGUUUA 


1928 


AGACAUAGUUUACAGAGAA 


1929 


GCAGAGAGAUUUCAUAACU 


1930 




1 ?"~r»:t Sense iM^^% Mmkt^Jh 




GAACAUACCUGUAAAUCAU 


1931 


GGACAUCGACCUGUUAUUA 


1932 


UAAAUGAACUCCUACAUAG 


1933 


GGAAUUAUCUCUGAACAUA 


1934 




! \ ^ '^Sense ^ , 




GCAGAAAUACAUCUACUAA 


1935 


GCUAUGAGCUGUUCUUCAA 


1936 


CGACAAAGCUCUACUCAUC . _ 


1937 


GCUCAACCCUGGAGACAUU " 


1938 




: f^W^^ ^ ^ Sense. ''r^^m 




GGACAAGACUCGCAGAUUU 


1939 


GCUGAGCGCUUUGCAAUAA 


1940 


CAAGAAGUCUCACGGGAAA 


1941 


UCACAGAGGCCAAGAAAUU 


1942 




Sense, ; ':.xt'^■■ . 


■^'■ ■ ■WM^ . .. 


UGGAAGCCAUCGCCAAAUA 


1942 


CAUCAGUGCAUGACGUUUA 


1943 


UGAAUGAGCUGGUGGAUUA 


1944 


UGCCAAAACUUACCUAUAA 


1945 




Sense 




GAGCUGCACUCCAAUGAGA 


1946 


GAAACCAAGCCAUUAAUGA 


1947 


CCAAGGAGCUACUGACAUU 


1948 



AGAGAAACAUGGCCCAAUA 



1949 



ITGB1 



ITGA4 



I @©M§ 1 


II 


CCACAGACAUUUACAUUAA 


1950 


GAAGGGAGUUUGCUAAAUU 


1951 


GAACAGAUCUGAUGAAUGA . 


1952 


CAAGAGAGCUGAAGACUAU 


1953 








GCAUAUAUAUUCAGCAUUG 


1954 


CAACUUGACUGCAGUAUUG 


1955 


GAACUUAACUUUCCAUGUU 


1956 


GACAAGACCUGUAGUAAUU 


1957 






1 1 


AG AAAG AGCU UGACAG U AA 


1958 


GGAAGUAGUUCACAAAAUA 


1959 


UGAAGUAUCUGUAUCGAAA 


1960 


GAGCUUCACUCCCUUAGUU 


1961 







UAAGGACUCUGAAGAUGUA 


1962 


GACAAAGUGUGUAAUUAUG 


1963 


GCUCAGGAGUUAGCAAGAA 


1964 


GAAACUGAAUACCUAAGAU 


1965 


GAAACUGAAUACCUAAGAU 


1966 


UAAGGACUCUGAAGAUGUA 


1967 


GACAAAGUGUGUAAUUAUG 


1968 


GCUCAGGAGUUAGCAAGAA 


1969 



RALGDS 



PRKCA 







CCAUCCAGCUGAUCCAGAA 


1970 


GAACCCUCCUGAUGAGAGU 


1971 


GAGGACAUCCACCAGUACA 


1972 








GAUUAGAGACCAAGGAUUU 




CCACUGAUGUGUGUUAAUU 


1973 


CAAUAGAACCUGUCAAUAU 


1974 


GAAGACAGGAAUCGAAUGA 


1975 








GAUGUGAGUAGAAGAGUUA 


1975 


GGAAGAAUUUGUACCAUUU 


1976 


GAACGACCUUUCUUUCUUU 


1977 


GGAGUCAUCUCUUCCUAUA 


1978 








GGAGAAGCCUCACCUCUUG 


1979 


GCAG AAAGG ACUCAAG AU U 


1980 


GAGAACAACUACUCAUUGA 


1981 


GAACUUCUCGUCACUGUAU 


1982 




II @SGi@© 


1 1 


GGAUUGUUCUUUCUUCAUA 


1983 


GAAGGGUUCUCGUAUGUCA 


1984 



MAP2K4 



MAP2K7 



MAPK8 



MAPK9 



AIF1 



BBC3 



BCL2L1 



BCL2L1 1 



GAAGAAGGAUGUGGUGAUU ' 


1985 


GGACUGGGAUCGAACAACA • 


1986 






1 mam II 


II 


GGACAGAAG UGG AAAU AUU 


1987 


UCAAAGAGGUGAACAUUAA 


1988 


GACCAAAUCUCAGUUGUUU 


1989 


GGAGAAUGGUGCUGUUUAA 


1990 








GAAGAGACCAAAGUAUAAU 


1991 


GAAGACCGGCCACGUCAUU 


1992 


GGAAGAGACCAAAGUAUAA 


1993 


GCAUUGAGAUUGACCAGAA 


1994 


UGAGAGAACGAGAAAGUUG 


1995 


GUGAAACCCUGUCUGCAUU 


1996 


GGAUCUCUCUCAACAACUA 


1997 


ACAACUAGGUGAACACAUA 


1998 








UCACAGUCCUGAAACGAUA 


1999 


GAUUGGAGAUUCUACAUUC 


2000 


GCUCAUGGAUGCAAAUCUU 


2001 


GAAGCUAAGCCGACCAUUU 


2002 








AAAGAGAGCUUAUCGUGAA 


2003 


GAUGAUAGGUUAGAAAUAG 


2004 


ACAAAGAAGUCAUGGAUUG 


2005 


GGAGCUGGAUCAUGAAAGA 


2006 








GAAAAGGGAUGAUGGGAUU 


2007 


CCUAGACGAUCCCAAAUAU 


2008 


GAGCCAAACCAGGGAUUUA 


2009 


UGAAACGAAUGCUGGAGAA 


2010 


UCACUGACCGAGAGAAAUA 


2011 


CCAAGAAAGCUAUCUCUGA 


2012 


AGACUCACCUAGAGCUAAA 


2013 








GCUGGAGGGUCCUGUACAA 


2014 


GAGCAAAUGAGCCAAACGU 


2015 


GGAGGGUCCUGUACAAUCU 


2016 


GACUUUCUCUGCACCAUGU 


2017 








CCAGGGAGCUUGAAAGUUU 


2018 


AAAGUGCAGUUCAGUAAUA 


2019 


GAGAAUCACUAACCAGAGA . 


2020 


GAGCCCAUCCCUAUUAUAA 


2021 








GAGACGAGUUUAACGCUUA 


2022 


AAAGCAACCUUCUGAUGUA 


2023 


CCGAGAAGGUAGACAAUUG 


2024 



BID 



BIRC2 



BIRC3 



BIRC4 



BIRC5 



BRCA1 



CARD4 



CASP10 



GCAAAGCAACCUUCUGAUG 


2025 


AGACAGAGCCACAAGGUAA 


2026 


GCAAGGAGGUUAGAGAAAU 


2027 


CAAGGAGGUUAGAGAAAUA 


2028 


UCUUACGACUGUUACGUUA 


2029 




Sense 




GAAGACAUCAUCCGGAAUA 


2030 


CAACAGCGUUCCUAGAGAA 


2031 


GAAAUGGGAUGGACUGAAC 


2032 


ACGAUGAGCUGCAGACUGA 


2033 




Sense ' 




GAAAGAAGCCUGCAUAUAA 


2034 


GAAAUUGACUCUACAUUGU 


2035 


ACAAAUAGCACUUAGGUUA 


2036 


GAAUACACCUGUGGUUAAA 


2037 




Sense 




GGAGAUGCCUGCCAUUAAA 


2038 


UCAAUGAUCUUGUGUUAGA 


2039 


GAAAGAACAUGUAAAGUGU 


2040 


GAAGAAAGAACAUGUAAAG 


2041 




■ ^- \- . Sense''-' - . - ^ 




GUAGAUAGAUGGCAAUAUG 


2042 


GAGGAGGGCUAACUGAUUG 


2043 


G AGG AACCCUGCCAUG U AU 


2044 


GCACGGAUCUUUACUUUUG 


2045 




.-xr-^^^.^^-^'hif . ^''M Sense, ,i - t, *^y> 




GGCGUAAGAUGAUGGAUUU 


2046 


GCAAAGGAAACCAACAAUA 


2047 


GCACAAAGCCAU UCU AAG U 


2048 


CAAAGGAAACCAACAAUAA 


2049 








CCAUACAGCUUCAUAAAUA 


, - - 2050 


GAAGAGAACUUAUCUAGUG 


^ 2051 


GAAGUGGGCUCCAGUAUUA 


2052 


GCAAGAUGCUGAUUCAUUA 


2053 


CCAUACAGCUUCAUAAAUA 


2054 




' ' • Sense 




GAAAGUUAAUGUCAAGGAA 


2055 


GAGCAACACUGGCAUAACA 


2056 


UAACAGAGAUUUGCCUAAA 


2057 


GCGAAGAGCUGACCAAAUA 


2058 




Sense 




CAAAGGGUUUCUCUGUUUA 


2059 


GAAAUGACCUCCCUAAGUU 


2060 


GAAGGCAGCUGGUAUAUUC 


2061 


GACAUGAUCUUCCUUCUGA 


2062 


GCACUCUUCUGUUCCCUUA 


2063 



CASP2 



CDKN1A 



CFLAR 



CLK2 



CLSPN 



CSNK2A 
1 



CTNNB1 



CXCR4 



CXCR6 



Sense 




GUAUUAAACUCUCCUUUGA 


2064 


GCAAGGAGAUGUCUGAAUA 


2065 


CAACUUCCCUGAUCUUUAA 


2066 


GCUCAAAGAUGUAAUGUAG 


2067 




Sense , 




GAACAAGGAGUCAGACAUU 


2068 


AAACUAGGCGGUUGAAUGA 


2069 


GAUGGAACUUCGACUUUGU 


2070 


GUAAACAGAUGGCACUUUG 


2071 




' ' Sense ^" ^T- ^ ' ^ 




GAUGUGUCCUCAUUAAUUU 


2072 


GAAGAGAGAUACAAGAUGA 


2073 


GAGCAUACCUGAAGAGAGA 


2074 


GCUAUGAAGUCCAGAAAUU 


2075 




Sense , 




GUGAAUAUGUGAAAUAGUG 


2076 


AAAGCAUGCUAGAGUAUGA 


2077 


UUAAGAAUGUGGAGAAGUA 


2078 


GAUAACAAGCUGACACAUA 


2079 




' 'iim^ Sensa^%'^ f 


^-^^ ^^^'^ 


GGACGUAAUUGAUGAAGUA 


2080 


GCAGAUGGGUUCUUAAAUG 


2081 


CAAAUGAGGUUGAGGAAAU 


2082 


GGAAAUACCUGGAGGAUGA 


2083 




-v" : "Seris^? --m [^'M- 




GAUCCACGUUUCAAUGAUA 


2084 


GCAUUUAGGUGGAGACUUC 


2085 


GAUGUACGAUUAUAGUUUG 


2086 


UGAAUUAGAUCCACGUUUC 


2087 








GCACAAGAAUGGAUCACAA 


2088 


GCUGAAACAUGCAGUUGUA 


2089 


GUACGUACCAUGCAGAAUA 


2090 


GAACUUGCAUUGUGAUUGG 


2091 




' Sense-r ^ ^-i-l 


, y>^j: . % 


GAAGCAUGACGGACAAGUA 


2092 


GAACAUUCCAGAGCGUGUA 


2093 


GUUCUUAGUUGCUGUAUGU 


2094 


CAUCAUGGUUGGCCUUAUC 


2095 




Sensei^i^ ^t - - 




GGAACAAACUGGCAAAGCA 


2096 


GAUCAGAGCAGCAGUGAAA 


2097 


GGGCAAAACUGAAUUAUAA 


2098 


GAUCUCAGGUUCUCCUUGA 


2099 



DAXX I I Sense ~~[ 



GAS41 



GTSE1 



HDAC3 



HDAC5 



CUACAGAUCUCCAAUGAAA 


2100 


GCUACAAGCUGGAGAAUGA 


2101 


GGAAACAGCUAUGUGGAAA 


2102 


GGAGUUGGAUCUCUCAGAA 


2103 




Sense 




GUAGUAAGCUAAACUGAAA 


2104 


GACAAUAUGUUCAAGAGAA 


2105 


GACAACAUCUCGUCAGCUA 


2106 


UAUAUGAUGUGUCCAGUAA 


2107 




Sense 




CAAAGAAGCUCACUUACUG 


2108 


GAACAGCCCUAAAGUGGUU 


2109 


GAACAUGGAUGACCCUAAG 


2110 


GGGCAAAGCUAAAUCAAGU 


2111 




■ -f. -4%' Sense Ml^. --W- 




GG AAAGCGAUG UGGAG AU U 


2112 


CCAAGACCGUGGCCUAUUU 


2113 


AAAGCGAUGUGGAGAUUUA 


2114 


GUGAGGAGCUUCCCUAUAG 


2115 






GAAUUCCUCUUGUCGAAGU 2116 


GUUAUUAGCACCUUUAAGA 


2117 


GGAGGGAGGCCAUGACUUG 


2118 


CAGGAGAGCUCAAGAAUGG 


2119 


GGAUAUGGAUUUCAGUUAA 


2120 


GGAAGUCGGUGCCUUGGUU 


2121 


GGAAGGAGAGGACUGGUUU 


2122 



HEC 



HIST1H2 
AA 



LMNB1 



LMNB2 







GCAGAUACUUGCACGGUUU 


2123 


GAGUAGAACUAGAAUGUGA 


2124 


GCGAAUAAAUCAUGAAAGA 


2125 


GAAGAUGGAAUUAUGCAUA 


2126 




: iWL . Sense, - 




GGCAAUGCGUCUCGCGAUA 


2127 


GAUCCGCAAUGAUGAGGAA 


2128 


GCAAUGCGUCUCGCGAUAA 


2129 


GAGGAACUCAAUAAGCUUU 


2130 




. \4y.%'- - ' t;/Sensje# ^= .^sw. ^ - ^ . ^ 


AAUAGAAGCUGUGCAAUUA 


2131 


CAACUGACCUCAUCUGGAA 


2132 


GAAGGAAUCUGAUCUUAAU 


2133 


GGGAAGGGUUUCUCUAUUA 


2134 




Sense 




GGAGGUUCAUUGAGAAUUG 


2134 


GGCAAUAGCUCACCGUUUA 


2135 


CAAAUACGCUUAGCUGUGU 


2136 


GGAGAUCGCCUACAAGUUC 


2137 



MYB 



MYT1 



NFKBIB 



NFKBIA 



NFKBIE 



NUMA1 



NUP153 



OPA1 



PARVA 



PIK3CG 



Sense 




GCAGAAACACUCCAAUUUA 


2138 


GUAAAUACGUGAAUGCAUU 


2139 


GCACUGAACUUUUGAGAUA 


2140 


GAAGAACAGUCAUUUGAUG 


2141 




Sense ■ J- 




GAGGUGAGCUGUUAAAUCA 


2142 


GCAGGGUGAUUUCCUAAUA 


2143 


GGGAGAAGAUAUUUAAUUG 


2144 


CAACUUCUCUCCUGAACUU 


2145 




mi'< • ■.^mSferise- - 




GGACACGGCACUGCACUUG 


2146 


GCACUUGGCUGUGAUUCAU 


2148 


GAGACGAGGGCGAUGAAUA 


2149 


CAUGAACCCUUCCUGGAUU 


2150 




' ' ' : Sense ^ n^mk,. j 




GAACAUGGACUUGUAUAUU 


2151 


GAUGUGGGGUGAAAAGUUA 


2152 


GGACGAGAAAGAUCAUUGA 


2153 


AGGACGAGCUGCCCUAUGA 


2154 




Kte' . - . , Sense . - . . - 




GAAGGGAAGUUUCAGUAAC 


2156 


GGAAGGGAAGUUUCAGUAA 


2156 


GGAAACUGCUGCUGUGUAC 


2157 


GAACCAACCACUCAUGGAA 


2158 








GGGAACAGUUUGAAUAUAA 


2159 


GCAGUAGCGUGAAGCAGAA 


2160 


CGAGAAGGAUGCACAGAUA 


2161 


GCAAGAGGCUGAGAGGAAA 


2162 








GAAGACAAAUGAAAGCUAA 


2163 


GAUAAAGACUGCUGUUAGA 


2164 ' 


GAGGAGAGCUCUAAUAUUA 


2165 


GAGGAAGCCUGAUUAAAGA 


2166 




Mi' " \ Sense ; ^ 




GAAAGAGCAUGAUGACAUA 


2167 


GAGGAGAGCUCUAUUAUGU 


2168 


GAAACUGAAUGGAAGAAUA 


2169 


AAAGAAGGCUGUACCGUUA 


2170 




/ I'-Sense ' j ^ 




CUACAUGUCUUUGCUCUUA 


2171 


GCUAAGUCCUGUAAGAAUA 


2172 


CAAAGGCAAUGUACUGUUU 


2173 


GAACAAUGGUGGAUCGAAA 


2174 




Sense 




AAGUUCAGCUUCUCUAUUA 


2175 



144 



PRKDC 



RASA1 



GAAGAAAUCUCUGAUGGAU 


2176 


GAACACCUUUACUCUAUAA 


2177 


GCAUGGAGCUGGAGAACUA 


2178 








GAUGAAAGCUCUAAAGAUG 


2179 


GAAAGGAGGUUCUAAACUA 


2180 


GGAAGAAGCUCAUUUGAUU 


2181 


GCAAAGAGGUGGCAGUUAA 


2182 








GGAAGAAGAUCCACAUGAA 


2183 


GAACAUACUUUCAGAGCUU 


2184 


GAACAAUCUUUGCUGUAUA 


2185 


UAACAGAACUGCUUCAACA 


2186 



SLC9A1 



- *-.^^V Sense-. l^.r 




GAAGAGAUCCACACACAGU 


2187 


UCAAUGAGCUGCUGCACAU 


2188 


GAAGAUAGGUUUCCAUGUG 


2189 


GAAUUACCCUUCCUCAUCU 


2190 



TEGT 





CUACAGAGCUUCAGUGUGA 


2191 


G AAC AU AU U UG AUCG AAAG 


i 2192 


GAGCAAACCUAGAUAAGGA 


2193 


GCAUUGAUCUCUUCUUAGA 


2194 



TERT 



■^a^, , Sense 




GGAAGACAGUGGUGAACUU 


2195 


GCAAAGCAUUGGAAUCAGA 


2196 


GAGCUGACGUGGAAGAUGA 


2197 


GAACGGGCCUGGAACCAUA 


2198 



TNFRSF 
6 



TOPI 



TOP2A 



TOP3A 







GAUACUAACUGCUCUCAGA 


2199 


GAAAGAAUGGUGUCAAUGA 


2200 


UCAAUAAUGUCCCAUGUAA 


2201 


UCAUGAAUCUCCAACCUUA 


2202 


GAUGUUGACUUGAGUAAAU 


2203 






GAAAGGAAAUGACUAAUGA 


2204 


GAAGAAGGCUGUUCAGAGA 


2205 


GGAAGUAGCUACGUUCUUU 


2206 


GGACAUAAGUGGAAAGAAG 


2207 








GAAAGAGUCCAUCAGAUUU 


2208 


CAAACUACAUUGGCAUUUA 


2209 


AAACAGACAUGGAUGGAUA 


2210 


CGAAAGGAAUGGUUAACUA 


2211 




Sense | 


CCAGAAAUCUUCCACAGAA 2212 | 



GAAACUAUCUGGAUGUGUA 


2213 


CCACAAAGAUGGUAUCGUA 


2214 


GGAAAUGGCUGUGGUAACA 


2215 




' Sense - , ^ ['^ . 




GAGACAAGAUGAAGACUGU 


2216 


GCACAUGGGCUGCGUCUUU 


2217 


CC AG UGCGCU U C AAG AUG A 


2218 


GAACAUCUGCUUUGAGGUU 


2219 








GGUAUUGCCUUGUGAAUUU 


2220 


GCAGAACAAUUACGAAUAG 


2221 


GUACAUAGCUGUUUGAAAU 


i 2222 


GCUGUAAACUUGUAGCAUU 


2223 



In addition, to identifying functional siRNA against gene families or 
pathways, it is possible to design duplexes against genes known to be involved in 
5 specific diseases. For example when dealing with hxmian disorders associated with 
allergies, it will be beneficial to develop siRNA against a number of genes including 
but not limited to: 
the interleukin 4 receptor gene 

(SEQ. ID NO. 2224: UAGAGGUGCUCAUUCAUUU, 
10 SEQ. ID NO. 2225: GGUAUAAGCCUUUCCAAGA, 
SEQ. ID NO. 2225: ACACACAGCUGGAAGAAAU, 
SEQ. ID NO. 2226: UAACAGAGCUUCCUUAGGU), 

the Beta-arrestin-2 
15 (SEQ. ID NO. 2227: GGAUGAAGGAUGACGACUA, 
SEQ. ID NO. 2228: ACACCAACCUCAUUGAAUU, 
SEQ. ID NO. 2229: CGAACAAGAUGACCAGGUA, 
SEQ. JD NO. 2230: GAUGAAGGAUGACGACUAU, ), 

20 the interferon-gamma receptor 1 gfene 

(SEQ. ID NO. 2231: CAGCAUGGCUCUCCUCUUU, 
SEQ. ID NO. 2232: GUAAAGAACUAUGGUGUUA, 
SEQ. ID NO. 2233: GAAACUACCUGUUACAUUA, 
SEQ. ID NO. 2234: GAAGUGAGAUCCAGUAUAA), 

25 
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the matrix metalloproteinase MMP-9 
(SEQ. ID NO. 2235: GGAACCAGCUGUAUUUGUU, 
SEQ. ID NO. 2236: GUUGGAGUGUUUCUAAUAA, 
SEQ. ID NO. 2237: GCGCUGGGCUUAGAUCAUU, 
5 SEQ. ID NO. 2238: GGAGCCAGUUUGCCGGAUA), 

the Slcl lal (Nrampl) gene 

(SEQ. ID NO. 2239: CCAAUGGCCUGCUGAACAA, 
SEQ. ID NO. 2240: GGGCCUGGCUUCCUCAUGA, 
10 SEQ. ID NO. 2241 : GGGCAGAGCUCCACCAUGA, 
SEQ. ID NO. 2242: GCACGGCCAUUGCAUUCAA), 

SPINK5 

(SEQ. ID NO. 2243: CCAACUGCCUGUUCAAUAA, 
15 SEQ. ID NO. 2244: GGAUACAUGUGAUGAGUUU, 
SEQ. ID NO. 2245: GGACGAAUGUGCUGAGUAU, 
SEQ. ID NO. 2246: GAGCUUGUCUUAUUUGCUA,), 

the CYPl A2 gene 
20 (SEQ. ID NO. 2247: GAAAUGC]&GUGUCUUCGUA, 
SEQ. ID NO. 2248: GGACAGCACUUCCCUGAGA, 
SEQ. ID NO. 2249: GAAGACApCACCAUUCUGA, 
SEQ. ID NO. 2250: GGCCAGAGGUUGACCUUCA), 

25 thymosin-beta4Y 

(SEQ. ID NO. 2251: GGACAGGCCUGCGUUGUUU, 
SEQ. ID NO. 2252: GGAAAGAGGAAGCUCAUGA, 
SEQ. ID NO. 2253: GCAAACACGUUGGAUGAGU, 
SEQ. ID NO. 2254: GGACUAUGCUGCCCUUUUG, 

30 

activin A receptor IB 

(SEQ. ID NO. 2255: ACAAGACGCUCCAGGAUCU, 
SEQ. ID NO. 2254: GCAACAGGAUCGACUUGAG, 
SEQ. ID NO. 2255: GAAGCUGCGUCCCAACAUC, 



SEQ. ID NO. 2256: GCAUAGGCCUGUAAUCGUA, 
SEQ. ID NO. 2257: UCAGAGAGUUCGAGACAAA, 
SEQ. ID NO. 2258: UGCGAAAGGUUGUAUGUGA, 
SEQ. ID NO. 2259: GCAACAGGAUCGACUUGAG, 
5 SEQ. ID NO. 2260: GAAUAGCGUUGUGUGUUAU, 
SEQ. ID NO. 2261: UGAAUAGCGUUGUGUGUUA, 
SEQ. ID NO. 2262: GGGAUCAGUUUGUUGAAUA, 
SEQ. ID NO. 2263: GAGCCUGAAUCAUCGUUUA, ), 

10 ADAM33 

(SEQ. ID NO. 2264: GGAAGUACCUGGAACUGUA, 
SEQ. ID NO. 2265: GGACAGAGGGAACCAUUUA, 
SEQ. ID NO. 2266: GGUGAGAGGUAGCUCCUAA, 
SEQ. ID NO. 2267: AAAGACAGGUGGCCACUGA), 

15 

the TAP 1 gene 

(SEQ. ID NO. 2268: GAAAGAUGAUCAGCUAUUU, 
SEQ. ID NO. 2269: CAACAGAACCAGACAGGUA, 
SEQ. ID NO. 2270: UGAGAAAUGUUCAGAAUGU, 
20 SEQ. ID NO. 2271 : UACCUUCAGUG'GAAACUUA, 

COX-2 

(SEQ. ID NO. 2272: GAACGAAAGUAAAGAUGUU, 
SEQ. ID NO. 2273: GGACUUAUGGGUAAUGUUA, 
25 SEQ. ID NO. 2274: UGAAAGGACUUAUGGGUAA, 
SEQ. ID NO. 2275: GAUCAGAGUUCACUUUCUU), 

ADPRT 

(SEQ. ID NO. 2276: GGAAAGAUGUUAAGCAUUU, 
30 SEQ. ID NO. 2277: CAUGGGAGCUCUUGAAAUA, 
SEQ. ID NO. 2278: GAACAAGGAUGAAGUGAAG, 
SEQ. ID NO. 2279: UGAAGAAGCUCACAGUAAA, ), 

HDC 



(SEQ. ID NO. 2280: CAGCAGACCUUCAGUGUGA, 
SEQ. ID NO. 2281: GGAGAGAGAUGGUGGAUUA, 
SEQ. ID NO. 2282: GUACAGAGCUGGAGAUGAA, 
SEQ. ID NO. 2283: GAACGUCCCUUCAGUCUGU), 

5 

HnmT 

(SEQ. ID NO. 2284: CAAAUUCUCUCCAAAGUUC, 
SEQ. ID NO. 2285: GGAUAUAUCUGACUGCUUU, 
SEQ. ID NO. 2286: GAGCAGAGCUUGGGAAAGA, 
10 SEQ. ID NO. 2287: GAUAUGAGAUGUAGCAAAU), 

GATA-3 

(SEQ. ID NO. 2288: GAACUGCUUUCUUUCGUUU, 
SEQ. ID NO. 2289: GCAGUAUCAUGAAGCCUAA, 
15 SEQ. ID NO. 2290: GAAACUAGGUCUGAUAUUC, 
SEQ. ID NO. 2291: GUACAGCUCCGGACUCUUC), 

Gab2 

(SEQ. ID NO. 2292: GCACAACCAUUCUGAAGUU, 
20 SEQ. ID NO. 2293: GGACUUAGAUGCGCAGAUG, 
SEQ. ID NO. 2294: GAAGGUGGAUUCUAGGAAA, 
SEQ. ID NO. 22^5: GGACUAGCCCUGCUGUUUA), and 

STAT6 

25 (SEQ. ID NO. 2296: GAUAGAAACUCCUGCUAAU, 
SEQ. ID NO. 2297: GGACAUUUAUUCCCAGCUA, 
SEQ. ID NO. 2298: GGACAGAGCUACAGACCUA, 
SEQ. ID NO. 2299: GGAUGGCUCUCCACAGAUA). 

30 In addition, rationally designed siRNA or siRNA pools can be directed against 

genes involved in anemia, hemophila or hypercholesterolemia. Such genes would 
include, but are not be limited to: 
APOA5 

(SEQ. ID NO. 2300: GAAAGACAGCCUUGAGCAA, 
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SEQ. ID NO. 2301: GGACAGGGAGGCCACCAAA, 
SEQ. ID NO. 2302: GGACGAGGCUUGGGCUUUG, 
SEQ. ID NO. 2303: AGCAAGACCUCAACAAUAU), 



5 HMG-CoA reductase 

(SEQ. ID NO. 2304: GAAUGAAGCUUUGCCCUUU, 
SEQ. ID NO. 2305: GAACACAGUUUAGUGCUUU, 
SEQ. ID NO. 2306: UAUCAGAGCUCUUAAUGUU, 
SEQ. ID NO. 2307: UGAAGAAUGUCUACAGAUA), 

10 

NOS3 

(SEQ. ID NO. 2308: UGAAGCACCUGGAGAAUGA, 
SEQ. ID NO. 2309: CGGAACAGCACAAGAGUUA, 
SEQ. ID NO. 2310: GGAAGAAGACCUUUAAAGA, 
1 5 SEQ. ID NO. 2309: GC ACAAGAGUU AUAAGAUC), 

ARH 

(SEQ. ID NO. 2310: CGAUACAGCUUGGCACUUU, 
SEQ. ID NO. 2311: GAGAAGCGCUGCCCUGUGA, 
20 SEQ. ID NO. 23 12: GAAUCAUGCUGUUCI3CUUU, 
SEQ. ID NO. 2313: GGAGUAACCGGACACCUUA), 

CYP7A1 

(SEQ. ID NO. 2314: UAAGGUGACUCGAGUGUUU, 
25 SEQ. ID NO. 23 1 5: AAACGACACUUUCAUCAAA, 
SEQ. ID NO. 2316: GGACUCAAGUUAAAGUAUU, 
SEQ. ID NO. 2317: GUAAUGGACUCAAGUUAAA), 

FANCA 

30 (SEQ. ID NO. 2318: GGACAUCACUGCCCACUUC, 
SEQ. ID NO. 2319: AGAGGAAGAUGUUCACUUA, 
SEQ. ID NO. 2320: GAUCGUGGCUCUUCAGGAA, 
SEQ. ID NO. 2321: GGACAGAGGCAGAUAAGAA), 
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FANCG 

■ (SEQ. ID NO. 2322: GCACUAAGCAGCCUUCAUG, 
SEQ. ID NO. 2323: GCAAGCAGGUGCCUACAGA, 
SEQ. ID NO. 2324: GGAAUUAGAUGCUCCAUUG, 
5 SEQ. ID NO. 2325: GGACAUCUCUGCCAAAGUC), 

ALAS 

(SEQ. ID NO. 2326: CAAUAUGCCUGGAAACUAU, 
SEQ. ID NO. 2327: GGUUAAGACUCACCAGUUC, 
1 0 SEQ. ID NO. 2328: CAACAGGACUUUAGGUUCA, 
SEQ. ID NO. 2329: GCAUAAGAUUGACAUCAUC), 

PIGA 

(SEQ. ID NO. 2330: GAAAGAGGGCAUAAGGUUA, 
1 5 SEQ. ID NO. 233 1 : GGACUGAUCUUUAAACUAU, 
SEQ. ID NO. 2332: UCAAAUGGCUUACUUCAUC, 
SEQ. ID NO. 2333: UCUAAGAACUGAUGUCUAA), and 

factor VIII 

20 (SEQ. ID NO. 2334: GCAAAUAGAUCUCCATOAC, 

SEQ. ID NO. 2335: CCAGAUAUGUCGUUCUUUA, 

SEQ. ID NO. 2336: GAAAGGCUGUGCUCUCAAA, 

SEQ. ID NO. 2337: GGAGAAACCUGCAUGAAAG, 

SEQ. ID NO. 2338: CUUGAAGCCUCCUGAAUUA, 
25 SEQ. ID NO. 2339: GAGGAAGCAUCCAAAGAUU, 

SEQ. ID NO. 2340: GAUAGGAGAUACAAACUUU). 

Furthermore, rationally designed siRNA or siRNA pools can be directed 
against genes involved in disorders of the brain and nervous system. Such genes 
30 would include, but are not be limited to: 
APBBl 

(SEQ. ID NO. 2341 : CUACGUAGCUCGUGAUAAG, 
SEQ. ID NO. 2342: GCAGAGAUGUCCACACGUU, 
SEQ. ID NO. 2343: CAUGAGAUCUGCUCUAAGA, 



SEQ. ID NO. 2344: GGGCACCUCUGCUGUAUUG), 



BACEl 

(SEQ. ID NO. 2345: CCACAGAGCAAGUGAUUUA, 
5 SEQ. ID NO. 2346: GCAGAAAGGAGAUCAUUUA, 
SEQ. ID NO. 2347: GUAGCAAGAUCUUUACAUA, 
SEQ. ID NO. 2348: UGUCAGAGCUUGAUUAGAA), 

PSENl 

1 0 (SEQ. ID NO. 2349: GAGCUGACAUUGAAAUAUG, 

SEQ. ID NO. 2350: GUACAGCUAUUUCUCAUCA, 

SEQ. ID NO. 2351: GAGGUUAGGUGAAGUGGUU, 

SEQ. ID NO. 2352: GAAAGGGAGUCACAAGACA, 

SEQ. ID NO. 2353: GAACUGGAGUGGAGUAGGA, 
1 5 SEQ. ID NO. 2354: CAGCAGGCAUAUCUCAUUA, 

SEQ. ID NO. 2355: UCAAGUACCUCCCUGAAUG), 

PSEN2 

(SEQ. ID NO. 2356: GCUGGGAAGUGGCUUAAUA, 
20 SEQ. ID NO. 2357: CAUAUUCCCUGCCCUGAUAf 
SEQ. ID NO. 2358: GGGAAGUGCUCAAGACCUA, 
SEQ. ID NO. 2359: CAUAGAAAGUGACGUGUUA), 

MASSl 

25 (SEQ. ID NO. 2360: GGAAGGAGCUGUUAUGAGA, 
SEQ. ED NO. 2361: GAAAGGAGAAGCUAAAUUA, 
SEQ. ID NO. 2362: GGAGGAAGGUCAAGAUUUA, 
SEQ. ID NO. 2363: GGAAAUAGCUGAGAUAAUG, ), 

30 ARX 

(SEQ. ID NO. 2364: CCAGACGCCUGAUAUUGAA, 
SEQ. ID NO. 2365: CAGCACCACUCAAGACCAA, 
SEQ. ID NO. 2366: CGCCUGAUAUUGAAGUAAA, 
SEQ. ID NO. 2367: CAACAUCCACUCUCUCUUG) and 

152 



NNMT 

(SEQ. ID NO. 2368: GGGCAGUGCUCCAGUGGUA, 
SEQ. ID NO. 2369: GAAAGAGGCUGGCUACACA, 
5 SEQ. ID NO. 2370: GUACAGAAGUGAGACAUAA, 
SEQ. ID NO. 2371: GAGGUGAUCUCGCAAAGUU). 

In addition, rationally designed siRNA or siRNA pools can be directed against 
genes involved in hypertension and related disorders. Such genes would include, but 
1 0 are not be limited to : 

Jingiotensin II type 1 receptor 

(SEQ. ID NO. 2372: CAAGAAGCCUGCACCAUGU, 
SEQ. ID NO. 2373: GCACUUCACUACCAAAUGA, 
SEQ. ID NO. 2374: GCACUGGUCCCAAGUAGUA, 
1 5 SEQ. ID NO. 2375: CCAAAGGGCAGUAAAGUUU, 
SEQ. ID NO. 2376: GCUCAGAGGAGGUGUAUUU, 
SEQ. ID NO. 2377: GCACUUCACUACCAAAUGA, 
SEQ. ID NO. 2378: AAAGGGCAGUAAAGUUU), 

20 AGTR2 

(SEQ. ID NO. 2379: GAACAUCUCUGGCAACAAU, 
SEQ. ID NO. 2380: GGUGAUAUAUCUCAAAUUG, 
SEQ. ID-NO. 2381: GCAAGCAUCUUAUAUAGUU, 

SEQ. ID NO. 2382: GAACCAGUCUUUCAACUCA), and other related targets. 

25 

Example XIII: Validation of Multigene Knockout using Rab5 and Eps 

Two or more genes having similar, overlapping functions often leads to 
genetic redundancy. Mutations that knockout only one of, e.g., a pair of such genes 
(also referred to as homologs) results in little or no phenotype due to the fact that the 
30 remaining intact gene is capable of fulfilling the role of the disrupted counterpart. To 
fully understand the function of such genes in cellular physiology, it is often 
necessary to knockout or knockdown both homologs simultaneously. Unfortunately, 
concomitant knockdown of two or more genes is frequently difficult to achieve in 
higher organisms {e.g. mice) thus it is necessary to introduce new technologies dissect 
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gene function. One such approach to knocking down multiple genes simultaneously 
is by using siRNA, For example, Figure 11 showed that rationally designed siRNA 
directed against a number of genes involved in the clathrin-mediated endocytosis 
pathway resulted in significant levels of protein reduction (e.g. >80%). To determine 
5 the effects of gene knockdown on clathrin-related endocytosis, intemalization assays 
were performed using epidermal growth factor and transferrin. Specifically, mouse 
receptor-grade EGF (Collaborative Research Inc.) and iron-saturated human 
transferrin (Sigma) were iodinated as described previously (Jiang, X., Huang, P., 
Marusyk, A. & Sorkin, A. (2003) Mol Biol Cell 14, 858-70). HeLa cells grown in 12- 

10 well dishes were incubated with *^^I-EGF (1 ng/ml) or I-transferrin (1 )ag/ml) in 
binding medium (DMEM, 0.1% bovine serum albumin) at 37*^C, and the ratio of 
internalized and surface radioactivity was determined during 5-min time course to 
calculate specific intemalization rate constant kg as described previously (Jiang, X et 
al.). The measurements of the uptakes of radiolabeled transferrin and EGF were 

15 performed using short time-course £issays to avoid influence of the recycling on the 
uptake kinetics, and using low ligand concentration to avoid saturation of the clathrin- 
dependent pathway (for EGF Lund, K. A., Opresko, L. K., Strarbuck, C, Walsh, B. J. 
& Wiley, H. S. (1990) J. Biol. Chem. 265, 15713-13723). 

20 The effects of knocking down Rab5a, 5b, 5c, Eps, or Eps 15R (individually) 

are shown in Figure 22 and demonstrate that disruption of single genes has little or no 
effect on EGF or Tfh intemalization. In contrast, simultaneous knock down of Rab5a, 
5b, aridt5c,*or Eps and Eps 15R, leads to a distinct phenotype (note: total 
concentration of siRNA in these experiments remained constant with that in 

25 experiments in which a single siRNA was introduced, see Figure 23). These 

experiments demonstrate the effectiveness of using rationally designed siRNA to 
knockdown multiple genes and validates the utility of these reagents to override 
genetic redundancy. 

30 Example XIV. VaUdation of Multigene Targeting Using G6PD, GAPDH, PLK, 
and UQC. 

Further demonstration of the ability to knock down expression of multiple 
genes using rationally designed siRNA was performed using pools of siRNA directed 
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against four separate genes. To achieve this, siRNA were transfected into cells (total 
siRNA concentration of lOOnM) and assayed twenty-four hours later by B-DNA. 
Results shown in Figure 24 show that pools of rationally designed molecules are 
capable of simultaneously silencing four different genes. 

Example XV. Validation of Multigene Knockouts As Demonstrated by Gene 
Expression Profiling, a Prophetic Example 

To further demonstrate the ability to concomitantly knockdown the expression 
of multiple gene targets, single siRNA or siRNA pools directed against a collection of 
genes (e.g. 4, 8, 16, or 23 differeiit targets) are simultaneously transfected into cells 
and cultured for twenty- four hours. Subsequently, mRNA is harvested from treated 
(and untreated) cells and labeled with one of two fluorescent probes dyes (e.g. a red 
fluorescent probe for the treated cells, a green fluorescent probe for the control cells.). 
Equivalent amounts of labeled RNA from each sample is then mixed together and 
hybridized to sequences that have been linked to a solid support (e.g. a slide, "DNA 
CHIP"). Following hybridization, the slides are washed and analyzed to assess 
changes in the levels of target genes induced by siRNA. 

Example XVI. Identifying Hyperfunctional siRNA 

Identification of Hyperfunctional Bcl-2 siRNA 

The ten rationally designed Bcl2 siRNA (identified in Figure 13, 14) were 
tested to identify hyperpotent reagents. To accomplish this, each of the ten Bcl-2 
siRNA were individually transfected into cells at a 300pM (0.3nM) concentrations. 
Twenty-four hours later, transcript levels were assessed by B-DNA assays and 
compared with relevant controls. As shown in Figure 25, while the majority of Bcl-2 
siRNA failed to induce functional levels of silencing at this concentration, siRNA 1 
and 8 induced >80% silencing, and siRNA 6 exhibited greater than 90% silencing at 
this subnanomolar concentration. 

By way of prophetic examples, similar assays could be performed with any of 
the groups of rationally designed genes described in Example VII or Example VIII. 
Thus for instance, rationally designed siRNA sequences directed against 
PDGFA 
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(SEQ. ID NO. 2383: GGUAAGAUAUUGUGCUUUA, 
SEQ. ID NO. 2384: CCGCAAAUAUGCAGAAUUA, 
SEQ. ID NO. 2385: GGAUGUACAUGGCGUGUUA, 
SEQ. ID NO. 2386: GGUGAAGUUUGUAUGUUUA), or 

5 

PDGFB 

(SEQ. ID NO. 2387: GCUCCGCGCUUUCCGAUUU, 
SEQ. ID NO. 2388: GAGCAGGAAUGGUGAGAUG, 
SEQ. ID NO. 2389: GAACUUGGGAUAAGAGUGU, 
10 SEQ. ID NO. 2390: CCGAGGAGCUUUAUGAGAU, 
SEQ. ID NO. 2391: UUUAUGAGAUGCUGAGUGA) 

could be introduced into cells at increasingly limiting concentrations to determine 
whether any of the duplexes are hyperfunctional. Similarly, rationally designed 
sequences directed against 
15 HIFl alpha 

(SEQ. ID NO. 2392: GAAGGAACCUGAUGCUUUA, 
SEQ. ID NO. 2393: GCAUAUAUCUAGAAGGUAU, 
SEQ. ID NO. 2394: GAACAAAUACAUGGGAUUA, 
SEQ. ID NO. 2395: GGACACAGAUUUAGACUUG), or 

20 

VEGF 

(SEQ. ID NO. 2396: GAACGUACUUGCAGAUGUG, 
SEQ. ID N0..2391t<jAGAAAGCAUUUGUUUGUA, 
SEQ. ID NO. 2398: GGAGAAAGCAUUUGUUUGU, 
25 SEQ. ID NO. 2399: CGAGGCAGCUUGAGUUAAA) could be introduced into cells 
at increasingly limiting concentrations and screened for hyperfimctional duplexes. 

Example XVII; Gene Silencing; Prophetic Example 

Below is an example of how one might transfect a cell. 
30 a. Select a cell line. The selection of a cell line is usually determined by the 

desired application. The most important feature to RNAi is the level of 
expression of the gene of interest. It is highly reconmiended to use cell lines 
for which siRNA transfection conditions have been specified and validated. 
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b. Plate the cells. Approximately 24 hours prior to transfection, plate the cells at 
the appropriate density so that they will be approximately 70 - 90% confluent, 
or approximately 1x10^ cells/ml at the time of transfection. Cell densities 
that are too low may lead to toxicity due to excess exposure and uptake of 

5 transfection reagent-siRNA complexes. Cell densities that are too high may 

lead to low transfection efficiencies and little or no silencing. Incubate the 
cells ovemight. Standard incubation conditions for mammalian cells are 31^C 
in 5% CO2. Other cell types, such as insect cells, require different 
temperatures and CO2 concentrations that are readily ascertainable by persons 
10 skilled in the art. Use conditions appropriate for the cell type of interest. 

c. SiRNA re-suspension. Add 20 jil siRNA universal buffer to each siRNA to 
generate a final concentration of 50 jiM. 

15 d. SiRNA-lipid complex formation. Use RNase-firee solutions and tubes. Using 

the following table, Table XI: 
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e. 



Table XI 


96-weU 24-weU 


Mixture 1 (TransIT-TKO-Plasmid dilution mixture) 


Opti-MEM 


9.3 ^1 


46.5 Ml 


TransIT-TKO (1 \xg/\i\) 


0.5 [i\ 


2.5 Ml 








Mixture 1 Final Volume 


10.0 jil 


50.0 Ml 



Mixture 2 (siRNA dilution mixture) 


Opti-MEM 


9,0 Ml 


45.0 Ml 


siRNA(l mM) 


1.0 Ml 


5.0 Ml 


'llllxture 2;l|ijttal Vplum§|^^- 


■^^#0.0Ml^^* 


b; --><5oiiii . - 




Mixture 3 (siRNA-Transfection reagent mixture) 


Mixture 1 


10 Ml 


50 Ml 


Mixture 2 


10 Ml 


50 Ml 


Mixture.3 Final VolumlbV 


, ;*;20mi 






Incubate 20 minutes at room temperature. 


Mixture 4 (Media-siRNA/Transfection reagent mixture) 


Mixture 3 


20 Ml 


100 Ml 


Complete media 


80 Ml 


400 Ml 


-r Miitt^fFiiiSl^iffigi^ - 


'■^''WoMi^J* 






Incubate 48 hours at 37**C. 



5 Transfection . Create a Mixture 1 by combining the specified amotints of OPTI-MEM 
seram free media and transfection reagent in a sterile polystyrene tube. Create a 
Mixture 2 by combining specified amounts of each siRNA with OPTI-MEM media in 
sterile 1 ml tubes. Create a Mixture 3 by combining specified amounts of Mixture 1 
and Mixture 2. Mix gently (do not vortex) and incubate at room temperature for 20 

10 minutes. Create a Mixture 4 by combining specified amounts of Mixture 3 to 

complete media. Add appropriate volume to each cell culture well. Incubate cells 
with transfection reagent mixture for 24 - 72 hours at 37°C. This incubation time is 
flexible. The ratio of silencing will remain consistent at any point in the time period. 
Assay for gene silencing using an appropriate detection method such as RT-PCR, 

15 Western blot analysis, immunohistochemistry, phenotypic analysis, mass 
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spectrometry, fluorescence, radioactive decay, or any other method that is now known 
or that comes to be known to persons skilled in the art and that from reading this 
disclosure would useful with the present invention. The optimal window for 
observing a knockdown phenotype is related to the mRNA turnover of the gene of 
5 interest, although 24 — 72 hours is standard. Final Volume reflects amount needed in 
each well for the desired cell culture format. When adjusting volumes for a Stock 
Mix, an additional 10% should be used to accommodate variability in pipetting, etc. 
Duplicate or triplicate assays should be carried out when possible. 

10 While the invention has been described in connection with specific 

embodiments thereof, it will be understood that it is capable of further modifications 
and this application is intended to cover any variations, uses, or adaptations of the 
invention following, in general, the principles of the invention and including such 
departure from the present disclosure as come within known or customary practice 

1 5 within the art to which the invention pertains and as may be applied to the essential 
features hereinbefore set forth and as follows in the scope of the appended claims. 
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